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PLANT PATHOGEN RESPONSE GENE 
CROSS-REFERENCE TO RELATED APPLICATIONS 

5 

This application claims benefit of U.S. Provisional Application No. 60/039,063 filed 
February 28, 1997. 

BACKGROUND OF THE INVENTION 

10 

Field of the Invention 

This invention relates to a novel DNA molecule that encodes a novel polypeptide, 
LSD1, which has an effect in regulating the initial response of plants to pathogens and the 
subsequent spread of plant cell death engendered by infection, the protein encoded by the 
15 gene, and transgenic plants comprising the DNA molecule. This invention also relates to 
novel DNA molecules encoding LSD1 related proteins LOL1 and LOL2. In addition, it 
relates to novel DNA molecules encoding proteins which directly interact with LSD1 . 

Description of the Related Art 

20 Controlled induction of cell death occurs during both normal plant development and 

as the rapid, localized response to pathogen infection known as the hypersensitive response 
(HR) (Stakman, 1915; Goodman and Novacky, 1994; Dangl et al., 1996). The HR is a 
feature of most, but not all, disease resistance reactions in plants. The disclosure of these 
publications and all others cited herein, as well as of the priority application, is incorporated 

25 herein by reference. 

Genetic control of disease resistance reactions is of two broad classes. The first is 
determined by specific interactions between particular alleles of pathogen avr (avirulence) 
gene loci and an allele of the corresponding plant disease resistance (if ) locus. When these 
alleles are present in both host and pathogen, the result is disease resistance in the plant, and 

30 the interaction is said to be "incompatible". If either the plant R allele or the cognate 
pathogen avr gene are absent or inactive, disease results and the interaction in said to be 
"compatible" (reviewed by Flor, 1971; Crute, 1985; Keen, 1990; Pryor and Ellis, 1993). A 
great deal of progress has been made recently in understanding the molecular structure of R 
genes and their predicted products (reviewed by Dangl, 1995; Staskawicz et al., 1995; Bent, 

35 1996). These molecules function to recognize avr dependent signals and trigger the plant 
cell to begin the chain of signal transduction events culminating in a halt of pathogen 
growth. The simplest mechanistic interpretation of allele-specific disease resistance is that 
the R gene product recognizes the avr gene product directly. Although no direct avr-R 
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protein interaction has been shown in planta, expression of avr genes in plant cells can be 
sufficient to trigger the HR in a tf-dependent manner, and avr-R protein-protein interactions 
can occur in yeast two-hybrid systems (Gopalan et al., 1996; Scofield et al., 1996- Tane et 
al., 1996). 

The second mode of genetic control of disease resistance is termed "non-host" 
resistance and describes in essence those interactions which lack genetic variability in either 
host or pathogen such that no virulent pathogen and no susceptible host line have been 
identified. While it is not beyond reason to assume that traditional "non-host" interactions 
are simply a series of allele specific recognition events occurring simultaneously (Whalen et 
al., 1988; Kobayashi et al., 1989; Valent et al., 1990), it is also possible that this mode of 
resistance is mechanistically distinct from that mediated by allele-specific interactions 
Pathogen ligands (termed elicitors) which mediate several key non-host interactions have 
been isolated, although their corresponding plant receptors have not (Cosio, et al. 1992; 
Nurnberger et al., 1994). 

Subsequent to pathogen recognition by either of these two systems, the plant cell 
deploys a battery of inducible defense responses. Chief among the earliest events are 
calcium influx, K -H exchange leading to alkalinization of the extracellular space, and an 
oxidative burst (reviewed in Godiard et al., 1 994; Hammond-Kosack and Jones, 1996). The 
latter is potentially mediated by a plasma membrane NADPH oxidase analogous to that 
used by mammalian neutrophils (Low and Merida, 1996), although other models exist 
(Bolwell et al.,^995). Parts of this cascade are linearly regulated in at least some systems: 
blocking of Ca influx blocks anion channel activity, the oxidative burst and downstream 
events including cell death; blocking anion channels effects only ROI production and 
defense gene activation, but not Ca + influx (Nurnberger et al., 1994; Levine et al 1996- 
May et al., 1996). 

Consequent production of reactive oxygen intermediates (ROI) occurs with kinetics 
and magnitude suggesting a key role in either pathogen ehmination, subsequent signaling of 
downstream effector functions, or both (reviewed by Baker and Orlandi, 1995; Low and 
Merida, 1996). H202 can have a key role in resistance responses, and' cell wall 
strengthening (Brisson et al., 1994; Levine et al., 1994; Levine et al., 1996), and superoxide 
produced as the proximal ROI in the burst has also been implicated in initiating HR (Doke, 
1983; Jabs et al., 1996). Transcription and translation of plant genes are required for HR. 
These signals are thought to culminate in transcriptional activation of a variety of plant 
genes, HR, and the production of both local and systemic signals that protect the plant from 
further infection. It is unclear whether these effector functions are controlled by linear, 
interdigitating, or bifurcating signal pathways. 

Cell death during the HR may be a direct consequence of ROI toxicity, or it may be 
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a secondary consequence of signals derived from ROI. It is not known whether HR is 
required to halt pathogen growth. Nonetheless, HR is correlated with the onset of systemic 
acquired resistance (SAR) to secondary infection in distal tissue (reviewed by Ryals et al., 
1996). In at least tobacco and Arabidopsis, enzymatic blocking of salicylic acid (SA) 

5 accumulation subsequent to infection alters disease resistance responses, and SA in distal 
tissues is required for SAR (Gafibey et al., 1993; Delaney et al., 1994; Vernooij et al. s 
1994). SA accumulates following the oxidative burst to high levels locally at infection 
sites. The biochemical properties of S A as an inhibitor of a variety of enzymes suggest a 
model whereby SA or a radical derived from it poisons the infected cell, causing its death 

10 (Enyedi et al., 1992; Malamy et al., 1992; Chen et al., 1994; Durner and Klessig, 1995; 
Rueffler et al., 1995). Recent descriptions of the morphology of cell death during infection 
suggest, in at least some cases, parallels with animal apoptosis (Mittler et al., 1995; Kosslak 
et al., 1996; Levine et al., 1996; Ryerson and Heath, 1996; Wang et al, 1996a; reviewed by 
Dangl et al., 1996). A molecular understanding of both the signaling events that control the 

15 onset of this specialized plant cell death and the mechanisms by which these cells die will 
hasten approaches to manipulate cell death to protect plants from disease. 

A number of researchers have isolated mutants in Arabidopsis which exhibit 
constitutive onset of HR-like cell death in the absence of pathogen (Greenberg and Ausubel, 
1993; Dietrich et al., 1994; Greenberg et al., 1994). These mutants resemble a variety of 

20 mutants in crop species isolated since the 1920s and broadly categorized as "lesion mimic 
mutations 0 (Langford, 1948; Kiyosawa, 1970; Walbot et al., 1983; Johal et al., 1994). A 
series of non-allelic mutations was isolated which expressed histochemical and molecular 
markers associated with disease resistance responses. These mutants subdivide the lesion 
mimic class into a "lesions simulating disease resistance" or Isd phenotype (Dietrich et al., 

25 1994). These mutants also exhibited heightened resistance to otherwise virulent bacterial 
and oomycete pathogens when lesions were present, demonstrating that these cell death 
phenotypes can trigger pathogen non-specific resistance resembling SAR. Similar 
"accelerated cell death" or acd mutants have been described by Greenberg and Ausubel 
(Greenberg et al., 1994). Greenberg and Ausubel (1993) additionally isolated a mutant 

30 which though expressing an acd phenotype was in fact more susceptible to pathogen. It is 
thus possible to identify genetically at least two types of cell death, namely those which 
feed into a pathway culminating in establishment of a disease resistant state, and those 
which do not 

The Isdl mutant is exceptional. In conditions permissive for wild type plant growth 
35 and in the absence of detectable microscopic lesions, the Isdl mutant is hyper-responsive to 
challenge by a variety of stimuli including pathogens and low doses of chemicals which 
trigger the onset of SAR (Dietrich et al., 1994). Mutant Isdl plants are resistant to 
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otherwise virulent pathogens in conditions where no spontaneous cell death lesions form. 
Following initiation of cell death in a local spot on a leaf, lesions propagate throughout the 
leaf and kill it 2-4 days later. Propagation of locally initiated cell death is confined to the 
inoculated leaf. Thus, LSD1 functions to negatively regulate both the initial response to 
pathogens and the subsequent spread of cell deatfi. Superoxide is a necessary and sufficient 
trigger for this phenotype, and superoxide production precedes onset of cell death by 8-16 
hours following initiation by three different triggers (Jabs et al., 1996). Therefore, the LSD 1 
gene responds to either superoxide or to a signal derived from it to down regulate or 
dampen the cell death response, resulting in the typical locally bounded HR. The invention 
herein includes the LSD1 gene, which encodes the first member of a new subclass of zinc- 
finger proteins in Arabidopsis. 

It is therefore an object of the invention to provide a novel DNA molecule, LSD1, 
isolated from Arabidopsis which works to protect plant cells in response to pathogens, and 
DNA molecules encoding LSD1 related proteins LOL1 and LOL2. 

It is a further object of the invention to provide the protein encoded by LSD 1, and 
transgenic plants comprising LSD1. Knowledge of the structure of the LSD1 gene allows 
accurate creation of particular mutants (e.g., deletion and point mutations), for example, 
mutants having a dominant negative phenotype, analogous to the mutants of Drosophila 
PANNIER gene (Ramain et al., 1993), using methods known in the art. This in turn allows 
engineering of transgenic crop plants which do not suffer cell death, but are still resistant to 
infection. In addition, expression of the dominant negative LSD1 protein may be refined so 
that it is expressed very quickly after infection. 

The LSD1 protein is also a useful target for herbicide development. Transgenic 
plants may be made in which LSD1 mutant genes are expressed which are resistant to 
herbicidal compounds which normally result in cell death in combination with the wild-type 
LSD1. Mutants of the LSD1 gene are tested in a Isdl background to determine if the mutant 
has a normal or novel function, and in a wild-type background to determine the existence of 
a dominant negative function. 

Other objects and advantages will be more fully apparent from the following 
disclosure and appended claims. 

SUMMARY OF THE INVENTION 
The invention herein comprises the DNA molecule of the wild-type LSD 1, which 
functions to monitor levels of a superoxide-dependent signal and negatively regulates a 
plant cell death pathway. The predicted LSD1 protein contains three zinc-finger domains, 
defined by CxxCxRxxLMYxxGASxVxCxxC (SEQ ID NO:54). The invention further 
comprises a protein encoded by LSD1, and transgenic plants comprising LSD1, and 
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mutations thereof. 

In particular, the preferred embodiments of the invention herein include the 
following: an isolated DNA molecule, encoding the LSD1 polypeptide sequence, selected 
from the group consisting of SEQ ID NOS:13-15; the LSD1 DNA molecule having the 
5 nucleotide sequence as set forth in SEQ ID NO:13; the DNA molecule that is cDNA; the 
DNA molecule which is genomic DNA; a chimeric construction comprising a promoter 
sequence and the LSD1 DNA molecule or portions of the LSD1 DNA molecule; a 
recombinant plant transformed with the LSD1 DNA molecule; a transformed plant 
comprising a DNA molecule encoding a protein as set out in SEQ ID NO: 16 or SEQ ID 

10 NO: 17; an isolated protein molecule comprising the protein set out in SEQ ID NO: 16 or 
SEQ ID NO: 17; a transformation vector comprising a LSD1 DNA molecule as set forth 
herein; an isolated DNA molecule encoding the zinc finger consensus sequence shown in 
SEQ ID NOS: 1-3; and anything that hybridizes to the LSD1 DNA molecule set forth 
herein under hybridization conditions as defined herein. 

15 Other objects and features of the inventions will be more fully apparent from the 

following disclosure and appended claims. 

BRIEF DESCRIPTION OF THE DRAWINflS 
Figures 1A-C show the physical delineation of the Isdl mutation. Figure 1A shows 

20 YAC clones at Isdl. The arrowheads imply the YAC clone extending in the direction given, 
solid vertical black bars denote YAC ends used to isolate genomic phage clones and 
subsequently converted into CAPS RFLP markers as described (refer to Figure 2 for their 
map position and to Tables 1 and 2, Examples II and in, for their definition). Figure IB 
shows the three BAC clones which contained the CAPS markers listed above BAC1G5. 

25 The arrowheads imply extension of the BAC clone in the direction shown. The scale in 
Figures 1 A and IB are the same. Figure 1C shows the genomic phage clones positioned 
under an expansion of three of the BACs. The diamond-filled bar represents the 8A6-1.3 
clone, which co-segregated with Isdl, used to isolate these phage. The Isdl deletion is 
noted at the bottom. 

30 Figure 2 is a genetic linkage map of the Isdl region. The vertical line at the left 

represents the section of Arabidopsis chromosome 4 between CH42 and B9-1.8 (telomeric 
toward bottom). CAPS-based RFLP markers discussed in the text intersect the 
chromosome, and their relative recombination frequencies in the F 2 mapping population are 
placed in the center. The number of meioses identified among the total number of F 2 's 

35 scored is at the right. The arrowhead denotes the co-segregating marker. 

Figures 3A-C show molecular fine mapping of the Isdl locus. Figures 3 A and 3B 
show genomic DNA blots demonstrating the presence of a 0.8 kb deletion om the Isdl 
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mutant Genomic DNA (5 g) from wild type Ws-0 or Isdl was digested with (for each pair 
of lanes from left to right) EcoRI, Hindlll, a double digest of Hindlll and Xbal. or Kpnl In 
Figure 3A, the blot was probed with the 0.8 kb EcoRI-Xbal. In Figure 3B, a duplicate blot 
was probed with the 4.5 kb Pstl-Xhol fragment The probes are depicted in Figure 3C and 
were .»lated fro m phage clones depicted in Figure 1C. Molecular weight markers are the 
Gibco-BRl 1 kb ladder. Figure 3C shows the restriction map in and around the tetfgene 
The extent of the deletion of this locus is shown as are the extent of the hybridization of the 
various restriction fragments with Isdl cDNAs. Genomic restriction fragments used in 
complementation experiments are underlined. The asterisk refers to an Xhol site derived 
from the phage lambda cloning junction. 

Figure 4 shows that the Isdl mutation is an mRNA null allele. RNA blots (1 g of 

L f, A) fr ° m ,SaftiSSUe ° f 5 WCek ° ld ^ kCpt m Sh ° rt (P™ve for Isdl 
grow*) 3 days after spraying with either INA (0.3 mg/ml powder containing 25% active 
mgrement or 4 mM), or wettable powder control. Spreading Isdl lesions had just started to 
appear at the time of leaf harvest. Probes were purified inserts from the LSD1 cDNA as 
represented by EST 82D11T7 (top), a PR-1 cDNA (Uknes et al, 1993b), and an acun 
cDNA. The blot was probed successively in the order displayed. 

Figure 5 shows the zinc finger domains (SEQ ID NOS.1-3) of the predicted LSD1 
protein and the alignment of the three zinc finger domains. The numbers at the left and 
right refer to amino acid residue position in the deduced LSD1 protein. Vertical lines 
indicate conservation in pairwise comparison, and a colon indicates conservative 
substitution A consensus sequence is listed below, with conservative substitutions noted in 
the second line of consensus where « + » is basic, plus charged; and "@" is amide, polar 
uncharged, hydrophilic. 

Figure 6 shows how the carboxyl portion of the deduced LSD1 protein is related to 
known DNA-binding and transcription factors. Vertical lines indicate conservation in 
pairwise comparison, and a colon indicates conservative substitution. Figure 6A shows 
homology of a slightly longer portion of the deduced LSD1 protein with mammalian insulin 
receptor substrate proteins. The LSD1 translation product (SEQ ID NO:4) is shown on the 
top, ahgned with the mouse insulin receptor substrate (SEQ ID NO:5). In this region all 
mammalian insulin receptor substrates are identical. Figure 6B shows the homology of 
LSD1, on each top line, with four known transcription factors. The LSD1 translation 
Product (SEQ ID NO:6) is shown on top, and below it are the related domainT^ a 
human early growth response (EGR) Zn-finger protein (SEQ ID NO:7, a human TGF-early 

^omvTr Tf, (SBQ 10 N0:8X a Xen ° pus laevis H " L - H ***** f -or 

(SEQ ID NO:9), and the human ELK-1 protein (SEQ ID NO:10). Figure 6C shows the 
homology of a LSD1 transcription product (SEQ ID NO:ll) with a putative maize 
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transcription initiator binding protein (SEQ ID NO: 12). GenBank accession numbers of 
each protein are listed at the right. 

Figure 7 shows the consensus sequence of the zinc finger domains (SEQ ID 
NOS:63-65, respectively) of LSD1 (A), LOL1 (B) and LOL2 (C). 

Figure 8 shows the homologies between the first (A), second (B) and third (C) zinc 
finger domains of LSD1, LOL1 and LOL2 



DETAILED DESCRIPTION OF THE INVENTION AND PREFERRED 

1° EMBODIMENTS THEREOF 

The present invention provides a genomic DNA sequence (SEQ ID NO: 13) and a 
cDNA sequence (SEQ ID NOS:14-15) or the LSD1 gene which is required for the 
regulation of initial plant response to pathogens, and cDNA proteins deduced (from short 
form, MG7-SEQ ID NO:16; from long form, MG, SEQ ID NO:17). 

15 In addition, the invention herein provides functional protein domain sequences 

involved in regulating genes controlling cell death. Gene expression can be regulated by 
attaching a promoter to the LSD1 gene, which may be either the native promoter or any 
other promoter. 

The invention herein includes the DNA molecule having the nucleotide sequence as 

20 set forth in SEQ ID NOS:13, 14 and 15, encoding either of two LSD1 polypeptides, which 
are preferably the LSDI polypeptides set forth in SEQ ID NOS:16 and 17. This DNA 
molecule may be cDNA or genomic. The invention also includes as the open reading frame 
any chimeric construction comprising a promoter sequence and the DNA molecule of the 
invention, a recombinant plant transformed with the DNA molecule, and any transformation 

25 vector comprising the DNA of the invention. In addition, the DNA sequence of either the 
full-length SEQ ID NO:13, or a shortened or otherwise modified version thereof, may be 
modified to optimize its expression in plants, with codons chosen for production of the 
same or a similar protein as encoded by the wild type LSDI gene. Other modifications of 
the LSDI gene that yield a protein having essentially the same properties as the LSD J gene 

30 are included within the invention herein. 

The invention herein also includes anything that hybridizes to the LSD1 DNA (SEQ 
ID NO: 13) of the invention as discussed above, under hybridization conditions, which are 
defined as: 7% Na dodecyl sulfate (SDS), 0.5 M sodium phosphate, pH 7.0, 1 mM EDTA at 
50C, and wash in 2X SSC buffer, 1% SDS, at 50C (Church and Gilbert, 1984). Proc. Natl. 

35 Acad. Sci. USA 81:1991-1995 (1984)). 

The novel LSD1 gene of the present invention, it its wild type form or as mutated by 
selected mutations and genetically engineered derivatives obtained as is known in the art, 
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and proteins encoded thereby, are included in the invention herein, and may be transferred 
into any plant host using methodology known in the art for purposes of altering the extent 
and type of plant resistance to pathogens, and to change resistance to particular herbicides. 

The mutant phenotype of the null Isdl allele suggests that the wild type product is a 
negative regulator of cell death. In addition, Isdl reacts to both nominally virulent 
pathogens, and to chemicals which trigger the onset SAR, with an HR-like response. But it 
is important to note that Isdl expresses wild type timing of R gene driven HR (Dietrich et 
al., 1994)~it is the subsequent spread of cell death which distinguishes the mutant. Thus, 
cell autonomous signals required for R gene function are intact in an Isdl null, but the' 
response to cell non-autonomous signals emanating from cells undergoing HR is perturbed. 
Collectively, these features of the mutant phenotype suggest that LSD1 functions to limit 
both the initiation of defense responses and the subsequent extent of the HR. The fact that 
an Isdl null is hyper-responsive to signals initiating the defense response and HR-like cell 
death additionally suggests that these paHiways are functionally intact in the wild type cell, 
but require a threshold level of signal for full activation. 

LSD1 appears to act as a transcription factor (or as a protein which sequesters a 
transcription factor). As outlined above, the oxidative burst in an infected cell generates a 
superoxide-dependent signal up-regulating the HR pathway. This signal overcomes the 
negative regulatory function of the available LSD1, and drives primary responding cells into 
the HR pathway. Additionally, the cells undergoing HR amplify the signal, probably via a 
sustained extracellular oxidative burst, to neighboring cells. The primary signal molecule 
may be diffusible over short ranges (Levine et al., 1994), could act as an autocrine signal, 
and could lead to the accumulation of a secondary signal molecule in a steep spatial gradiem 
from the infection site. At a critical point in the signal gradient, a threshold is reached. 
Above that point the pro-death pathway operates, and below it the pro-death response would 
be attenuated by LSD1. Such a gradient is formed by SA and SA-conjugates (Enyedi et al., 
1992); SA biosynthesis can be induced by hydrogen peroxide (Leon et al., 1995); and sub- 
effective doses of SA can amplify pathogen-derived signals (Kauss et al., 1992; Kauss and 
Jeblick, 1995; Mauch-Mani and Slusarenko, 1996). Thus, it could be that an SA gradient 
dictates LSD 1 activity. 

Constitutive expression levels by LSD1 could suffice to protect cells below the 
critical signal threshold for death induction. The time lag of 12-16 hours observed between 
superoxide production initiated in Isdl by a variety of triggers and the onset of cell death 
(Jabs et al., 1996), which could provide sufficient time for up-regulation of LSD1 activity 
before irrevocable commitment to death during wild type responses, so that cell death could 
spread until sufficient active LSD1 accumulates. Alternatively, this time lag could represent 
a requirement for biosynthesis of pro-death intermediates and LSD1 normally could operate 
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by interdicting this pathway. LSD1 could positively regulate anti-cell death targets, 
potentially including genes involved in cell survival, ROI detoxification, or in degradation 
of a key intermediate in the pro-death pathway. Alternatively, LSD1 could act as a 
transcriptional repressor directly on genes in the pro-death effector pathway. This scenario 

5 differs from the first only in that the set of target genes would be different. The availability 
of extragenic suppressors of Isdl will aid in identifying LSD1 targets (Jabs et al., 1996). 

This model also explains the runaway cell death phenotype of the null Isdl mutant. 
In the absence of LSD1, the threshold normally required before commitment to HR is 
removed. Thus, minimal up-regulation of the superoxide-dependent signal drives the cell 

10 into the HR pathway. Hence the ability of Isdl to respond to virulent pathogens as if 
resistant derives from a lack of background inhibition of the HR pathway normally 
operating in the cell. Moreover, extracellular superoxide produced during the oxidative 
burst initiates the same series of events in cells immediately surrounding the site of 
initiation, and the cell death propagation indicative of the Isdl phenotype results. Because 

15 the null Isdl mutant still requires superoxide for initiation of cell death propagation, it is 
unlikely that superoxide directly regulates LSD1 activity. This further suggests that a 
superoxide-dependent signal is the autocrine which propagates the response to neighboring 
cells. 

The A. thaliana Isdl mutant phenotype is characterized by enhanced disease 

20 resistance, spontaneous formation of lesions in the absence of cell death initiators and 
failure to limit the extent of cell death. The wildtype LSD1 protein therefore negatively 
regulates a cell death pathway involved in plant defense responses. 

The LSD1 gene encodes a protein containing a novel zinc finger protein, which is 
included in the invention herein and is defined by its three consensus zinc fingers: 

25 CxxCRxxLMYxxGASxRxVxCxxC (SEQ ID NO:52). These three zinc finger domains 
have not been observed before in the range of zinc finger proteins. As shown in Dietrich et 
al., 1997, the LSD1 gene is a key negative regulator of hypersensitive cell death in plants. 
We sought other versions of this consensus zinc finger sequence in other plant proteins. 

The data on homologies between the LSD1 and LOL1 and LOL2 zinc finger 

30 domains indicates that LSD1 as well as LOL1 and LOL2 are members of a novel subclass 
of zinc finger proteins that are involved in plant cell death pathways. LOL1 and LOL2 
might function in cell death phenomena leading to hypersensitive response and disease 
resistance as has been shown for LSD1. The homologues may also be involved in 
programmed cell death (PCD) pathways occurring in plants. Examples of PCD n plants 

35 include lateral root development, tracheary element differentiation, and abscission of leafs. 
Preliminary expression studies suggest that LOL2 is expressed in flowers and siliques. 
Thus a role for LOL2 in PCD pathways leading to petal senescence, anther dihiscence or 
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PCD of nucellar cells is not unlikely. It is also possible that LOL2 is involved in the 
hypersensitive response and disease resistance in flowers, thus protecting seeds and 
ultimately the following generations from pathogen. Alternatively, LOL2 could be up- 
regulated during the hypersensitive response. Use of LOL1 and LOL2 should allow 
prediction of the protein's function with respect to protection from programmed cell death. 

The consensus sequences defined by the LSD1, LOL1 and 10L2 zinc finger domains 
(Figures 7-8) are thus far unique in the available deduced protein databases. Because zinc 
finger domains of this type bind DNA and thereby regulate gene activation, it is highly 
likely that the consensus zinc finger domain defined here is required for proper regulation of 
related sets of genes. Furthermore, because zinc finger DNA binding domains of related 
sequence generally control related cellular processes, the new consensus defined here 
should also do so. Because LSD1 is known to negatively regulate cell death induced by 
pathogens, it is highly likely that LOL1 and LOL2 also control plant cell death. Thus the 
utility of this portion of the invention lies in production of transgenic plants which have 
mutated versions of the LOU or LOL2 genes or which overexpress these proteins. Such 
plants will likely be more resistant to pathogen attack, if, in the first case, the LOL genes 
function to repress defense response (as does LSD1). Alternatively, if the LOL genes 
function to activate defense mechanisms, then overexpression will lead to a more effective 
pathogen response. Because zinc finger proteins featuring other non-LSDl type DNA 
binding domains function to either activate or repress gene transcription, we cannot 
distinguish at present between these two models. 

The invention also includes plant proteins, and the genes which encode them which 
directly interact with LSD1 protein. Gene regulation in response to pathogen attack is 
controlled, in part, by the repression and activation of genes. The LSD1, LOL1 and LOL2 
proteins encode a novel branch of the zinc-finger DNA binding protein superfamily with 
roles in controlling plant cell death. As such, they are expected to interact with other 
proteins. Paradigms of gene activation currently demonstrate that DNA binding proteins 
can have two classes of "partners". The first class sequesters the DNA binding protein in 
the cell's cytosol. These partner proteins hold the DNA binding protein out of the nucleus 
until the correct cellular stimulus is received. THis stimulus disrupts the physical 
interaction, and the DNA binding protein is free to migrate into the nucleus and activate or 
repress transcription. The second class of protein which interacts with DNA binding protein 
is made up of proteins which are partners having the role of "enhancing" the gene activation 
or repression encoded by the DNA binding protein. These partners are termed "co- 
activators" or "co-repressors" and they may or may not have intrinsic DNA binding activity. 
We have identified several genes whose protein products interact physically with the LSD1 
protein using a common assay, called a "yeast two-hybrid interaction trap" to detect such 
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interactions genetically (Fields and Sternglanz, 1994; Finley and Brent, 1996). Because the 
inactivation of LSD1 by mutation leads to enhanced disease resistance, the LSD1 partner 
proteins represent novel targets for engineering plants with enhanced resistance to 
pathogens. Thus, this invention includes all proteins which interact with the cell death 
5 regulator LSD1 (SEQ ID NOS: 66-91 (includes sequential pairs of nucleic acids and 
corresponding amino acid sequences). 



10 The features of the present invention will be more clearly understood by reference to 

the following examples, which are not to be construed as limiting the invention. 

EXAMPLES 
Example I Care and maintenance of plants 

15 Plants were grown in a chamber at 9 hours light per day, 22°C day temperature and 

20°C night temperature essentially as described (Dietrich et al., 1994). 

Example II Isolation of DNA and RNA, probe preparation, cloning 

Small scale genomic DNA preps were made from single leaves (~lcm long rosette 

20 leaves) (Lukowitz et al., 1 996). The DNA pellet was re-suspended in 50 ml of Tris/EDTA 
(TE) and 1 ml was used in a 20 ml polymerase chain reaction (PCR). Large scale genomic 
DNA preps were done based on the protocol of (Rogers and Bendich, 1985), modified such 
that concentration in the 2X hexadecyltrimethylarnmonium bromide (CTAB)(Sigma, St. 
Louis, MO) buffer was increased to 3% and the precipitated DNA was resuspended in 

25 Tris/EDTA/sodium chloride (TEN) buffer and digested with 100 mg/ml, followed by two 
extractions with chloroform/iso-amyl alcohol and a final precipitation. 

RNA was isolated by grinding fresh tissue in liquid nitrogen to a fine powder and 
extraction in 1 ml of Trizol reagent (Gibco-BRL, Gaithersburg, MD) per 100 mg tissue 
fresh weight RNA was isolated according to the manufacturer's protocol. PolyA+ RNA 

30 was isolated using DynaBeads (Dynal, Oslo, Norway). RNA blots were formaldehyde 
agarose gels and contained either 15 mg total RNA or 1 mg polyA+ RNA. HyBond filters 
for DNA or RNA blots (Amersham, Little Chalfort, United Kingdom) were hybridized in 
6xSSC, 5X Denhardt's solution, 0.1% SDS and 100 mg/ml sheared Herring sperm DNA at 
65°C. Washes were in 0.2X SSC, 0.1% SDS at the same temperature. RNA blots were 

35 stripped for re-hybridization in 5 mM TRIS/2mM EDTA, (pH8.0), 0.1X Denhardt's 
solution for 1 hour at 65°C. 
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Example m Isolation of new CAPS markers and genetic mapping of Isdl 

After establishing linkage to the agamous (AG) co-dominant amplified polymorphic 
sequences (CAPS) marker (Konieczny and Ausubel, 1993), we subcloned and end- 
sequenced a 1.6 kb Hindlll fragment from the RFLP cosmid marker g3883 (position 73 5 
on the Arabidopsis RI map; Lister and Dean, 1993- see 
http:/W.nott.ac.uk^^Wtop_frame.html) > and primers designed based 'on this 
sequence. This primer set amplified a rapid amplified polymorphic DNA (RAPD) marker 
(size difference in Ws-0 versus Col-0 without restriction digestion), and map data generated 
using this primer allowed us to place Isdl below (telomeric to) it. Probe B9-1.8, isolated as 
a 1.8 kb Sstl-EcoRI fragment from the JGB9 genomic phage clone (RI map position -75- 
gift of Dr. George Coupland, Cambridge Laboratories, Norwich U.K.) was converted into a 
CAPS marker. Mapping of this polymorphism placed Isdl above (centromeric to) it (Fig 2) 
Recombinants were identified as homozygous for one of these CAPS markers and 
heterozygous for the other using DNA from F2 individuals. F3 progeny from these 
recombinants were then scored as either homozygous Isdl, segregating Isdl, or homozygous 
wild-type for lesion spread. All CAPS markers we developed are described in Table 1 
(below). 

Table L New PCR based RFLP fCAPS^ marker. H.riv.H d uring cloning »f 

Marker 
ch42 

g3883-1.6 
gl3838-1.4 



B9-1.8 



1H1L-1.6 



5F7R-1.5 



Enzvme 


PCRorod. 


Col-0 


Ws-0 


Clal 


1.4 kb 


750 bp 


1.4 kb 






650 




none 




1.4 kb 


0.7 kb 






(uncut) 


(uncut) 


Hinfl 


1.4 kb 


450 bp 


450 bp 






330 


330 






280 


280 






200 


160 


Hinfl 


1.8 kb 


420 bp 


420 bp 






260 


260 






240 








180 


180 








160 






140 


140 


Ddel 


1.6 kb 


1.0 kb 


700 bp 






300 bp 


300 








(doublet?) 


NlalV 


1.5 kb 


1.0 kb 


1.2 kb 
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250 bp 250 bp 
200 bp 

20B4-1.6 Ddel 1.6 kb 900 bp 700 bp 

400 400 
220 

180 180 

8A6-1.3 Taql 1.3 kb 800 bp 800 bp 

400 250 

220 150 



Example IV Map refinement 

YACs were defined (Schmidt et al., 1995; Schmidt et al., 1996, http://genome- 
www.stanford.edu/Arabidopsis/JIC-contigs.html), confirmed by DNA blotting to establish a 
contig and their ends were isolated by vectorette PCR as described (Matallana et al., 1992; 
Grant et al., 1995). These ends were also used to isolate genomic phage from a Ws-0 
genomic library (Fig. 1). Insert fragments of 1-3 kb were cloned into PBS and end 
sequenced for derivation of primers identifying new CAPS. PCR conditions (DNA Engine 
MJ Research) for all CAPS primer pairs except 8A6-1.3 and Isdl deletion primers are: 
92°C, 3'; 35 cycles of (denature 92°C, 30"; anneal 50°C, 30"; extend 72°C, 2'30"); 72°C, 3'. 
For 8A6-1.3 and the Isdl deletion primer pairs we used 53°C annealing. Table 2 shows the 
primer sequences used to identify new CAPS markers. 

Table 2. Primer sequences used to identify new CAPS markers used for clo nin p hdl 



ch42 for 


5'-cag tgg ate ttt cct cag acg-3' (SEQ ID NO:l 8) 


ch42 rev 


5'-cat ctt ctt ctg caa tct ggg-3' (SEQ ID NO:19) 


g3883-1.6for 


5'-cat cca tea aac aaa etc c-3' (SEQ ID NO:20) 


g3883-1.6rev 


5'-tgt ttc aga gta gec aat te-3' (SEQ ID NO:21) 


gl3 138-1. 4 for 


5'-cac gtt agt tag tta gaa gg-3' (SEQ ID NO:22) 


gl3138-1.4rev 


5'-ctg atg ttc tct aca aat gg-3' (SEQ ID NO:23) 


B9-1.8for 


5'-cgt ate cgc art tct tea ctg c-3' (SEQ ID NO:24) 


B9-1.8rev 


5'-cat ctg caa cat ctt ccc cag-3' (SEQ ID NO:25) 


1H1L-I.6fbr 


5'-ttg agt cct tct tgt ctg-3' (SEQ ID NO:26) 
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1H1L-I.6rev 


5'. 


-eta gag ctt gaa agt tga tg-3 ' (SEQ ID NO:27) 


5F7R-1.5 for 


5'. 


-gaa tgg tgt aac caa act c-3' (SEQ ID NO:28) 


5F7R-1.5rev 


5'. 


-cat acc gta tga tgg aac-3' (SEQ ID NO:29) 


20B4L-1.6for 


5'- 


-gaa etc att gta tgg acc-3' (SEQ ID NO:30) 


20B4L-1.6rev 


5'- 


-eta aga tgg gaa tgt tgg-3' (SEQ ID NO:31) 


8A6-1.3for 


5'- 


•cca aga aga gaa aac gga ga-3' (SEQ ID NO:32) 


8A6-1.3rev 


5'- 


•aac aat agg agg tgc aga gt-3' (SEQ ID NO:33) 



Primers to amplify across the Isdl deletion: 
Isdl far side: 5'-acc taa caa aaa gaa aag tgt gtg agg-3 ' (SEQ ID NO:34) 

Isdl outside 5'-ata ata aac cct act age tot aac aag-3' (SEQ ID NO:35) 

hdl alt, spl. 5' 5'-ctg eta ctt tea tec aaa c-3 ' (SEQ ID NO:36) 



Example V Vector construction for complementation 

The Agrobacterium vacuum infiltration procedure was used to generate transgenic 
plants (Bechtold et al., 1993; Grant et al., 1995). Vectors were derived from pGPTV-Hyg 
(Becker et al., 1992) as follows: pSGCGF was made by restricting pGPTV-Hyg with 
HindlD and SacI and replacing this fragment with a Hindlll-SacI fragment containing the 
polylinker from pIC20H (GenBank accession L08912; provided by Steve Goff, Novartis, 
Research Triangle Park, N.C). Either the 7kb Xhol or 4.5 kb Pstl-Xhol genomic fragments 
were cloned into this, the former into the unique vector Sail site, the latter as a Sacl-Sall 
fragment derived from an intermediate cloning step into pBS as a Pstl-Xhol fragment. The 
pHyg35S vector was made by cloning a four enhancer-containing 35S promoter fragment as 
a Hindin-Xbal fragment into pGPTV-Hyg (provided by Dr. Douglas C. Boyes, Univ. of 
North Carolina, Chapel Hill). The EST 82D1 1 cDNA sequence was isolated as a Sall-Xbal 
fragment from pZLl (Newman et al., 1994) and cloned into Xhol-Xbal digested pHyg35S. 

Example VI Cloning 

The genomic Ws-0 library in 1GEM11 was a gift of Dr. Kenneth A. Feldmann 
(Univ. of Arizona). The cDNA library is an oligo-dT primed library prepared from polyA+ 
Col-0 mRNA from leaves cloned into 1ZAPII (Stratagene, La Jolla, CA) according to the 
manutacturer's instructions (gift of Dr. Douglas C. Boyes and Dr. Murray R. Grant). 
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Example VII LSD1 sequences 

The sequences of the LSD1 cDNA (SEQ ID NOS:14 and 15) and the 4.5 kb LSD1 
XhoI-PstI genomic fragment (SEQ ID NO:13; the longest 5'LSDl cDNA starts at base 
1892 of this sequence) are deposited in GenBank as accessions U 87833 and U 87834, 
respectively. Endpoints of the various LSD1 cDNAs isolated are shown in Table3A and 
examples are provided by SEQ ID NO: 14 (short form from cDNA MG7 as shown in Table 
3) and SEQ ID NO: 15 (long form, from cDNA MG8). The polypeptides deduced from 
these are shown in Fig. 11-12, respectively. Table 3B shows the sizes of each intron 
deduced from comparison of the sequence shown in SEQ ID NO: 13. 

Table 3. Sequence characteristics of the LSD 1 gene 
Endpoints of independent LSD1 cDNAs 



cDNA 


5' end point 


Alternate splice 


3' end point 


MG7(2) 


C 1 


short 


A 1021 


EST 82D11 


A 27 


short 


T 1031 


MG4 


C59 


short 


A 1188* 


MG10 


C59 


short 


G1225 


MG5 


G67 


short 


A 1205 


MG2(4) 


G90 


short 


A 1106 


MG8 (2) 


G98 


long 


A 1082 


MG16(2) 


C 103 


short 


A 1066 


MG11 


C 117 


long 


G1225 



Numbers in parentheses refer to the number of isolates of the same clone. Nucleotide 
numbers at the 5' and 3' ends refer to nucleotide positions from SEQ ID NO:l3. An A at 
the 3' endpoint can be either an A in the genomic sequence or the first A of the polyA tail. 
The endpoint marked with an * had no polyA tail. 



Intron sizes 
intron # 



size in nucleotides 
88 



2 (short splice) 68 
2 (long splice) 
3 



129 
89 
489 



4 



5 



100 



6 



92 
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20 



25 



35 



16 

87 



Intron splice junction positions are located at bses 198-199, 260-261, 447-448, 552-553, 
692-693, 764-765, and 836-837 in SEQ ID NO:13. 

Exam^eVm Genetic and physical mapping of bdl 

The Isdl mutation segregates as a monogenic recessive (Dietrich et al., 1994). F2 
progeny of a cross between Isdl (Ws-0 background) and Col-0 (LSD1) were analyzed using 
the co-dominant amplified polymorphic sequences (CAPS) mapping procedure (Konieczny 
and Ausubel, 1993) to first establish linkage to the AG marker on chromosome 4. The 
closely linked gl3838 probe (3 recombinants in 1632 meioses) was used to identify YAC 
(yeast artificial chromosome) clones (Schmidt et al., 1995; Schmidt et al., 1996). We 
constructed a physical contig of these YACs, shown in figure 1A. We used'labeled YAC 
ends CIC1H1L, yUPSFTR and EG20B4L to isolate genomic phage clones, subcloned 
fragments form each of these, end-sequenced the subclones, derived primer sequences and 
15 developed new CAPS markers (see Tables 1 and 2). The CAPS markers 1H1L-1.6 and 
5F7R-1.5 mapped closest to Isdl (1 and 3 recombinants, respectively from 2054 meioses); 
see Tables 1 and 2 for new CAPS markers). We hybridized these two CAPS markers to 
filters containing bacterial artificial chromosome (BAC) clone arrays (Choi et al., 1995, 
distributed by the Arabidopsis Biological Resource Center, Ohio St. Univ.), and isolated the 
five BAC clones depicted in Figure 2B. Because 5F7R-1.5 and 1H1L-1.6 genetically flank 
Isdl (Figure 1 B), BAC clone 1 G5 should contain the gene. 

As 1G5 was the only BAC clone to physically span the relevant genetic region, we 
connected BACs 6H3 and 8A6 by walking in a genomic phage library. We defined a 5kb 
Hindin fragment from BAC 8A6 which hybridized only to itself and BAC 1G5. When 
used as a probe on filters containing restriction digests of the relevant BAC clones, this 
fragment hybridized to a 1.3 kb EcoRI fragment which also was present only on BACs 8A6 
and 1G5. This 8A6-1.3 clone, (small box in Figure 1C) was used to isolate three phage 
clones, two of which are depicted in Figure 1C. Labeled inserts from each detected BAC 
clones 1G5, 6H3 and 8A6, thus providing multiple redundancy of genomic cloned DNA 
30 encompassing Isdl. We also converted 8A6-1. 3 into a CAPs marker, and found that it co- 
segregated with Isdl in 2054 meioses. This map resolution of approximately 0.05 map 
units, suggested that Isdl was within 5-15 kb (at 100-300 kb per map unit; Schmidt et al., 
1995; Schmidtetal., 1996) in either direction of 8A6-1.3. 

We probed genomic Arabidopsis DNA blots of digested wild type Ws-0 and Isdl to 
confirm co-linearity of the cloned and genomic DNA immediately surrounding 8A6-1.3. 
We noted that a variety of fragments detected a genomic DNA rearrangement in Isdl 
relative to wild type Ws-0 (data not shown). This rearrangement corresponded to a loss of 
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restriction sites and a deletion as noted in Figures 1C and 3C The Isdl mutant comes from 
an Agrobacterium mutagenized population of Arabidopsis, and it is known that the 
transformation procedure can generate non-T-DNA associated mutations (Feldmann, 1991). 
We subcloned and sequenced various wild type genomic DNA fragments at this position, 
5 and compared their sequences to several databases, including the Arabidopsis EST database 
(Rounsley et al., 1996, http://www.tigr.org/tdb/at/at.html). One EST clone (EST 82D1 1T7; 
GenBank accession T45220) exhibited blocks of identity to our genomic DNA sequence, 
suggesting the presence of introns in the latter. Because the gene encoding this EST is 
largely deleted in Isdl, it became a candidate LSD1 gene. 

10 

Example IX Complementation of Isdl 

To confirm that the genomic deletion encompasses LSD1, we constructed subclones 
from the genomic phage as shown in Figure 3C for complementation into the T-DNA 
binary vector pSGCGF. Because the typical method for generation of transgenic 

15 Arabidopsis, vacuum infiltration of Agrobacterium carrying binary T-DNA vectors, triggers 
the propagative cell death indicative of the Isdl phenotype, we devised an alternate 
complementation strategy. We transformed Fl plants of Isdl x Col-0, and plated surface- 
sterilized seeds of the next (F2) generation onto media containing hygromycin as a selective 
antibiotic. We then identified hygromycin resistant transformants which were homozygous 

20 for Ws-0 alleles at 5F7R-1.5, 1H1L-1.6, and 8A6-1.3, and thus were Isdl/lsdl homozygous 
mutants. These individuals contained both mutant and wild type alleles for the CAPS 
marker which spans the Isdl deletion, because a wild type allele is present on the transgene. 
These transgenic plants were treated with droplets of 2,6-dichloroisonicotinic acid (INA); 
0.3 mg/ml wettable powder containing 25% active ingredient, Uknes et al., 1993a) a potent 

25 inducer of SAR and the Isdl phenotype (Dietrich et al., 1994). If the mutation were 
complemented, then INA treatment should not lead to spreading cell death. Table 4 shows 
that transgenic plants carrying either the 7kb Xhol fragment or the 4.5 kb Pstl-Xhol (Figure 
3C) all survived this treatment, and are thus complemented for the Isdl mutation. Selfed F3 
progeny from a complemented F2 individual carrying either the 4.5 kb XhoI-PstI fragment 

30 or the 7 kb Xhol fragment were also analyzed. All F3 progeny which inherited the 
transgene were complemented (Table 4), while all of their non-transgenic sibs still exhibited 
the Isdl phenotype (data not shown). In no case did wild type control plants exhibit 
spreading cell death after INA application. 



35 



Table 4. Comp lementation of the Isdl mutant 

# of plants complemented/# transgenics tested from: 
Construct Independent F2s Transgenic F3 progeny 
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7kbXhol l/l A 20/20* 

3/3° 21/21° 

kb Pstl-Xhol 2/2 A 14/14 8 

35S-cDNA 1/1 A i9/i9 B 

T 



Selected for hygromycin resistance and screened for homozygous Ws-0 alleles through the 
Isdl genetic interval as described, except where noted in . Individual F 2 s were both drop 
tested with INA and shifted to LD conditions (Dietrich et al., 1994). 

Selfed progeny from a complemented F 2 individual (homozygous Ws-0 alleles through the 
Isdl interval) were screened by PCR at F 3 for presence of the hygromycin resistance gene 



and then INA tested 

c 



F 2 parents were identified as hygromycin resistant and heterozygous through the Isdl 

interval, then selfed and re-screened as hygromycin resistant and homozygous Ws-0 
through the Isdl interval at F before INA testing. 



Due to low numbers of independent F2 transformants which were homozygous 
mutant through the Isdl interval from the original transformation, we also isolated F2 
transformants carrying the 7 kb Xhol fragment which were originally identified as 
heterozygote at the CAPS markers flanking Isdl. Selfed progeny from these should 
segregate both the transgene and the Isdl mutation. Among these progeny, we identified F3 
individuals which were homozygous Ws-0 through the Isdl interval and carried the 
transgene. As shown in Table 4, these also were all complemented for protection against 
INA-induced spreading cell death. We conclude that the 4.5kb Pstl-Xhol fragment carries 
the Isdl gene and sufficient cis control elements to ensure its expression. 

All transgenic plants complemented for the INA-induced Isdl mutant phenotype 
were also complemented for initiation of spreading cell death after transfer to non- 
permissive long day conditions as well (Dietrich et al., 1994; not shown). Thus, the 
complementing DNA corrects the mutant phenotype induced by two independent stimuli. 

Example X Identification of alternately spliced LSD1 transcripts 

We sequenced all of the complementing 4.5 kb Pstl-Xhol genomic DNA fragment 
(SEQ ID NO:13), eight independent cDNAs (Example VII) and completed the sequence of 
the full 82D1 1T7 EST sequence. Among the cDNAs, we identified two classes expressing 
open reading frames of either 184 or 189 amino acids (SEQ ID NO:16 and 17). An 
alternate splice which adds 61bp to the 5' region of some cDNAs also provides an alternate 
translation start, hence, the extra five amino acids in SEQ ID NO:17. The sequences of both 
cDNA classes matched exactly the genomic sequence except at the positions of 7 introns 
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(see Table 3). Nucleotide 1 of the longest cDNA is at position 1 892 in the 4.5 kb Pstl-Xhol 
genomic sequence (SEQ ID NO:13 ). Thus, 1891 nucleotides of promoter are sufficient for 
appropriate expression in complementation of the Isdl mutation. The cDNA 5' ends are 
clustered (Table 3), suggesting that the longest could be full length. We also complemented 

5 the Isdl mutation by transformation of the full insert from EST clone 82D1 1T7 expressed 
from the strong and constitutive cauliflower mosaic virus 35S promoter (see Table 3) 
proving that this cDNA contains the entire LSD1 coding region. The 3' ends of these 
cDNAs are very heterogeneous, suggesting the presence of multiple polyadenylation 
addition signals (Table 3). No other significant open reading frames were observed in the 

10 4.5 kb Pstl-Xhol genomic clone. 

When either the EST 82D11T7 clone, or a 0.8 kb EcoRI-Xbal genomic fragment 
covering the Isdl deletion were used as probes on RNA blots, a rare mRNA of 
approximately 1.2 kb was detected in leaf tissue of wild type Ws-0 plants (Figure 3). This 
length is consistent with the size of the longest cDNA, supporting the conclusion that we 

15 have identified a nearly full-length transcript. Importantly, this mRNA was completely 
lacking in mRNA prepared from Isdl leaves, furthering the argument that it encodes LSD1 . 
The finding that Isdl is an mRNA allele was corroborated by sequencing across the 
genomic deletion in the mutant (Figure 3). The 5' border of the deletion is an A at 
nucleotide 55 and the 3' boundary is in the fourth intron (data not shown). It is noteworthy 

20 that expression of this candidate mRNA was unaffected by application of INA (Figure 4, 
top). The expected high level of INA-induced PR-1 mRNA accumulation in leaves of both 
wild type and Isdl (Figure 4, middle) served as a control in this experiment for efficacy of 
INA treatment. 

The Isdl phenotype can be observed in all cell types examined after initiation of 
25 lesion formation (Dietrich et al., 1 994). RNA blot analysis of seedlings, stems, leaves and 
flowers demonstrated that the LSD1 gene is expressed constitutively in each of these 
Arabidopsis tissues (data not shown). Thus, the requirement for LSD1 activity in these 
tissues is consistent with the gene's expression pattern. 

30 Example XI The LSD1 mRNA encodes a novel zinc-finger domain 

We searched a variety of databases with the predicted translation product of the 
LSD1 cDNA sequence. Several striking features emerged. First, there are three zinc-finger 
domains, depicted in Figure 5 (SEQ ID NOS:l-3), which share remarkable homology with 
one another. These are C-x-x-C, or type IV, zinc-fingers, according to the classification of 
35 Sanchez-Garia and Rabbitts (1994), and they share most homology with plant relatives of 
the GATA-1 transcription factor (Evans and Felsenfeld, 1989; Omichinski et al., 1993). 
The plant members of this sub-family described to date include the CO gene, which controls 
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transition to flowering (Putterill et al., 1995), a set of related DNA binding proteins 
(Yanagisawa, 1995; De Paolis et al., 1996) and a gene whose transcription is salt stress- 
induced (Lippuner et al., 1996). None of these proteins shares with LSD1 the consensus 
homology within the Zn-fingers. The second homology domain is derived from the carboxyl 
portion of LSD1, from residues 129 to 180 (Figure 6-SEQ ID NO:4). This region of LSD1 
exhibits homology to three broad classes of regulatory proteins. First, all mammalian 
insulin receptor substrates; second, a set of animal transcription factors; and third, a maize 
transcription initiator binding protein. 

The conceptual LSD1 translation product also identified two additional Arabidopsis 
ESTs via their predicted amino acid homology. Importantly, each has at least one C-x-x-C 
Zn-finger and most of the associated consensus residues found in the LSD1 internal 
homologies. They are ESTs 172A7T7 (GenBank R6552)(SEQ ID NO: 58 and 132J2IT7 
(GenBank T45809). Thus, it is probable that LSD1 is the first member of a widely 
distributed Zn-finger sub-family in plants, defined by the internal homology within each 
zinc-finger. The other amino acids in the consensus section are not known to be found in 
any other zinc finger proteins. 

Example XII Identification of expressed target sequence tags (EST) and cDNAs 
containing LSD 1 -ty pe zinc finger domains 

As discussed in the text prior to the Examples, the predicted amino acid sequence of 
the LSD1 zinc fingers was used to search the GenBank database (NCBI). Two Arabidopsis 
thaliana ESTs (EST132J21T7 and EST 172A7T7) were identified, each of which contains 
at least two zinc finger domains and most of the associated consensus residues found in the 
LSD1 internal homologies (Dietrich, 1997). These ESTs were ordered from Ohio State 
University Arabidopsis Biological Resource Stock Center and resequenced. Sequences 
were analyzed with the Genetics Computer Group programs (Devereaux et al 1994) A 
specific probe isolated from EST172A7T7 was subsequently used for screening' of cDNA 
and genomic libraries. The bacterial strain carrying EST132J21T7, however, was not 
viable. Therefore, degenerated primers were designed based on the EST132J21T7 
sequence. Genomic Arabidopsis thaliana Ws-0 DNA was used in the PCR reaction and 
gave rise to a specific PCR product of approximately 400 bp. This fragment was subcloned 
via the TA Cloning Kit (Invitrogen, Carlsbad, CA) into pBluescript KS(+). Two new genes 
were identified as described here. Their predicted protein products are highly related to that 
of LSD1 indicating an involvement in the control of cell death in plants 



Example XTTT LOL1 cDNA 

Poly A + RNA isolated from uninduced and P. syringae DC3000 induced 
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Arabidopsis thaliana Col-0 leaf tissue was reverse transcribed. The resulting cDNA 
population was subcloned unidirectionally into the EcoRI/Xhol - sites of a lambda-Zap II 
vector using the cDNA-synthesis Kit (Stratagene, La Jolla, CA) according to the 
manufacturer's directions. The titer of this MG-library was calculated as 2.5 x 10 6 pfu 
Approximately 8x10 pfu of the amplified MG-library were subsequently screened with a 
P dCTP labeled probes (Stratagene 'Prime it' Kit) specific for EST132J21T7 or 
EST172A7T7. With the probe specific for EST132J21T7, four cDNA clones were 
identified and subcloned via the Stratagene excision system. One clone contained an insert 
of less than 100 bp in length and was not further analyzed. The three rernaining clones were 
sequenced by standard protocol (primers: M13F, M13R, PE6, and PE7); for primer 
sequences refer to Table 5, below). Clones 2 and 3 contained identical open reading frames 
(ORFs) and were homologous to EST132J21T7 and to another identical and overlapping 
EST clone, EST119C9T7. The fourth clone consisted of a chimeric cDNA of 
approximately 1500 bp, with approximately 400 bp similarity to EST132J21T7, 
EST1 19C9T7, and clones 2 and 3. It was also not analyzed further. 

Table 5. Primers and primer sequences used 



Primer 


Primer Sequence 


SEO ID NO 


M13F 


5'- 


GTA AAA CGA CGG CCA TG -3 ' 


37 


M13R 


5'- 


GGA AAC AGC TAT GAC CAT G -3' 


38 


PE6 


5'- 


TTC ATG GCA ATG GTG TGA CCC C -3' 


39 


PE7 


5'- 


CTG CCG GAT TCT TGA TCG AAG A -3' 


40 


PE8 


5'- 


AGA GGA AGG TCC GCC TCC GG -3* 


41 


PE9 


5'- 


CTC TGC TCT CCT GAG ACT GCT T -3' 


42 


PE13 


5'- 


CAT CAT AAT GTC TCC TTT TGA GAC -3' 


43 


PE15 


5V 


GCC ATC CAT TAT TCA TCG CCT -3 ' 


44 


PE23 


5'- 


GAG GAG GAA GAA CTG CAG ATT CC -3' 


45 


PE30 


5'- 


GTG CTC CAT GTC CAA ATC ATA C -3' 


46 



Clone 2, with an insert length of 908 bp represents a full length cDNA clone, as 
determined by the presence of an open reading frame flanked by untranslated sequences, 
and was renamed LOL1 (Lsd one /ike)(SEQ ID NO:47). We confirmed that the LOU 
cDNA and EST132J21T7 are encoded by the same gene using genomic DNA (Southern) 
blot analysis (data not shown). The LOL1 protein of 154 amino acids (SEQ ID NO:48) 
contains three zinc finger domains of the LSDl-type (SEQ ID NOS:49-51). The consensus 
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sequence of the LOL1 zinc finger domains is defined by CxxCxxLLMYxxGAxSxCxxC 
- (SEQIDNO:53). 

Example XIV LOL2 cDNA 
5 By screening the MG-cDNA-library, no clones homologous to EST172A7T7 could 

be obtained. Therefore, the AB-cDNA-library (derived from RNA isolated from different 
tissues of sterile grown plants, available at the European Arabidopsis Stock Center, 
Cologne, Germany) was screened with a PdCTP labeled probe specific for EST 1 72 A7T7. 
Six homologous cDNA clones were obtained and subcloned into the Smal site of 
1 0 pBluescript KS(+). Restriction analysis indicated that the inserts were encoded by the same 
gene. Only the longest insert was sequenced following standard protocol (primers used: 
M13F, M13R, PE8 and PE9: for primer sequences, refer to Table 5. We demonstrated that 
this insert contained an ORF of 500 bp homologous to EST172A7T7. This non-full length 
cDNA was designated LOL2 (SEQ ID NO:54). The deduced protein (SEQ ID NO:55) 
15 consisting of two LSDl-type zinc finger domains extending from bases 130-195 and 244- 
309 of SEQ ID NO:54 (SEQ ID NOS:56-57, respectively). Comparison to EST172A7T7 
shows that the EST (SEQ ID NO:58) contains a 124 bp insertion (bases 386-509 after the 
second zinc finger of SEQ ID NO:58), leading to a different C-terminal. Comparison of 
these two partial cDNA sequences with the genomic LOL2 sequence (see below) 
20 demonstrates that they are alternate splice forms from the same gene encoding two related 
proteins. This conclusion is strengthened by the fact that the LOL2 cDNA and 
EST172A7T7 hybridize to the same genomic DNA fragment and therefore are encoded by 
the same gene (data not shown). Thus, sequence analysis of genomic LOL2 clones shows 
that the non-identical C-tennini of LOL2 and EST172A7T7 are due to alternative splice 
sites. The genomic sequence of LOL2 (SEQ ID NO:59, has a putative TATA-box sequence 
and polyadenylation signal (bases 922-930 and 2539-2544), and the exon borders of an 
alternative splice site (bases 2256-2382). The derived amino acid sequence extends from 
bases 1231-2462. 



25 



30 



35 



Example XV Isolation of genomic LOL2 sequences from an Arabidopsis thaliana Col-0 
library 

8 x 10 genomic lambda clones (lambda GEM11, European Arabidopsis Stock, 
Center) were screen with a a P dCTP labeled probe specific for EST172A7T7. Nine 
clones homologous to LOL2 EST172A7T7 could be identified. Restriction analysis 
demonstrated that the nine clones belonged to five different classes. Inserts ranging from 
two to five kb in size were isolated and subcloned into either SacI ore BamHI sites of 
pBluescript KS(+). Sequence information derives from two overlapping clones, 
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sequentially sequenced with primers M13R, PE9, PE13, PE15, PE23 and PE30 (see Table 
' 5). 

The genomic LOL2 sequence has a length of 3060 bp. Promoter and 5' untranslated 
regions consist of approximately 1200 bp. The translation products are encoded by three 
5 exons, which are interrupted by two introns of 182 bp and 458 bp length, respectively. The 
overall length of the coding sequence is 1232 bp. Due to alternative splice sites, two 
proteins which differ in their C-terminal regions are encoded by the LOL2 gene (SEQ ID 
NO:59). A first protein, of 155 amino acids (SEQ ID NO:60), is identical to the LOL2 
cDNA and contains two zinc finger domains of the LSDl-type. The other translation 
10 product corresponds to EST172A7T7, consists of 147 amino acids, and contains two and a 
half zinc finger domains (SEQ ID NO:61). The consensus sequence of the two zinc finger 
domains of LOL2 is CxxCxxLLxYxxGxxxVxCSSC (SEQ ID NO:62). 

Example XVI Obtaining interacting genes 

15 

The methodology for this Example is known to those skilled in the art and 
summarized in Fields and Sternglanz, 1994, and Finley and Brent, 1996. The LSD] short 
or LDS1 long open reading frames; were cloned into the "bait vector" pEG202 of the 
commonly available LexA yeast two-hybrid system (Matchmaker , Clonetech, Palo Alto, 

20 CA) to generate plasmids pEG202-L and pEG202-S. These encode fusion proteins of the 
LexA DNA binding domain and the full length LSD1 protein of both long and short 
isoforms (SEQ ID NOS 14 and 15). Yeast strain EGY48 is transformed with this plasmid, 
and appropriate controls performed to ascertain the LSD1 fusion protein encoded by 
plasmids pEG202-L and pEG202-S did not intrinsically activate expression of the yeast 

25 markers used in this system. A yeast gene expression library was constructed in plasmid 
pJG4-5 using RNA from Arabidopsis leaves infected with Pseudomonas syringae. This 
library encodes fusion proteins of expressed Arabidopsis genes and the B42 transcriptional 
activation domain. The library was transformed en masse into the yeast strain EGY48 
carrying either plasmid pEG202-L or -S. From an equivalent of 6 million clones screened, 

30 122 were isolated. The longest insert of a member from each of these classes was 
sequenced using standard DNA sequencing methods. Because the novel Arabidopsis gene 
so identified is produced as an active translation fusion in this system, one is immediately 
able to identify the deduced protein sequence. The most interesting sequences thus defined, 
and their deduced protein sequences, are set forth herein as SEQ ID NOS: 66-91. 

35 The first main class of LSD 1 -interacting proteins has no database homologues. 

These proteins encode putative "sequestration" proteins for LSD1 whose function is to 
inhibit LSD1 function until the correct pathogen signal is received. Their utility lies in 
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manipulation of the interaction with LSD1 in plant cells such that LSD1 is altered in its 
ability to regulate the response to pathogen. Alternatively, these novel LSD1 -interacting 
proteins may encode new components of the gene regulation machinery working together 
with LSD1 to control transcription in response to pathogen infection. These proteins are 
valuable because of the knowledge that LSD 1 is a key regulator of cell death in plants in 
response to pathogens. Proteins which physically interact with LSD1 share in this cellular 
function. 

The second class defines proteins having database homologies to other proteins 
strongly suggesting a role in control of gene transcription (e.g., CAAT box binding proteins' 
which are known to bind the common CAAT regulatory unit in DNA preceding nearly all 
genes encoding eukaryotic mRNA). This finding is completely consistent with the 
embodiment described above, in which the LSD1 partner proteins identify other 
components of the gene regulatory machinery required for response to pathogens 
Manipulation of the expression of, for example, CAAT box binding proteins, will result in 
altered response to pathogen infection. 

While the invention has been described with reference to specific embodiments, it 
will be appreciated that numerous variations, modifications, and embodiments are possible, 
and accordingly , all such variations, modifications, and embodiments are to be regarded u 
being within the spirit and scope of the invention. 
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SEQUENCE LISTING 

SEQ ID NO:l 

LVCHGCRNLLMYPRGASNVRCALCNTINMV 



SEQIDNO:2 

1 1 CGGCRTMLMYTRGAS S VRCS CCQTTNLV 



SEQIDNO:3 

INCGHCRTTLMYPYGAS S VKCAVCQFVTNV 

SEQIDNO:4 

MSNGRV- PLPTNRP - NGTACPPST - STSTPPSQTQTVWENPMSVDBSGKLVSNV 



SEQ ID NO:5 

MSPG- VAPVPSNRKGNGDYMPMS PKSVSAP - QQI INPIRRHPQRVDPNGYMM 

SEQIDNO:6 

VPLPTNRP-NGTACPPSTSTSTPPSQTQTWVENPMSVDESGKLVSNV 



SEQIDNO:7 

VPLPANNPW- TTWPS TPPSQPPAVCPPW- 



SEQ ID NO:8 

VPLPANNPW- TTWPS TPPSQPPAVCPPW 



SEQIDNO:9 

IPVYTNSNV-GTALPPSVSPSVSPSVT 



SEQ ID NO:10 

WLP-NAAPAGAAAPPSGSRSTSPS 



SEQIDNO:ll 

SNGRVPLPTNRPN-GTACPPSTSTSTPPSQTQTVWENPMSVDESGKLVSNV 



SEQ ID NO:12 

SRALVPVPAADPNAG-AIVPANKSKRSPEQGQRRIRR 
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SEQ ID NO:13 



10 30 50 

GATCAAATCTAGTTACGCTTAAATTTGGATATATCTAAGGTTTCTTCGTCAATATATGGA 

70 90 110 

G CTTACG AAAACGAAAGAGTGAGCTACGAGGAAC TAAATCAATGAAGATAAGAGGAATGA 

130 150 17Q 

AGGAGAGAAGATCACCAAGGTGTAGAAATTTCTGAAGTCGTCTCCTCCAATCTCCACTAT 

190 210 230 

TGGTTTGTTCAGAACTTGAGAAGGCCTTAGATCCAAGCCATTAGTAACCTCTCTATGGCC 

250 270 290 

ATAAGTGACCTTAAGAGAGACCAACCTCGTGAAAGGATCAAGAACATCTCCAACAACACT 

310 330 350 

GCCGACCACGAGAGGATCTCTACGACTTAAAGACATATTTATCTTGGATCTCAAGTATCT 

370 390 410 

CAATAAAATGTTTTGCTTCTAACCTTATGAACCCTTACTTGCTATTCTTTATATAACGTT 

430 450 470 

TTGGGAATTGCAATAATTAGCTATTTAGCTTTATTCTCTCCAATGAAATCATTACCAGGG 

490 510 530 

TCTTTTCGTGTATAGTTATCnTCGAGAATCTACAACTCGTTC^^CGTACGTATATCACrT 

550 570 590 

ATAATTCATGTTTTTrrTTCTTTCCTTTTTTCTAAATTTATAGTATTCTTATTCCAAAAC 

610 630 6 5o 

CCACCAGTATAAAACAGAAATAATCATArrCCAAATTATACATCATCCACraGTTTCTTG 

670 690 710 

CTAGCCACTAGTATGTAATTTATTCTGACTTATCATTGGAACTTCATGAACTATTTAAAA 

730 750 770 

TAATGTCACAAGCATATAATATGCTGCATATTTGCGTACGTCACGCATTTTGCGTCACAT 

790 810 830 

GTCACTCATTTAAATAGTTAAGGACACTACATTACACCGATTATGTATGATGTTAATGCA 

850 870 ego 

TTTTAGAATAACTCCTTCAACCTAAACCATCATATAAAAGTATATATGCTCCAGATAAAT 

910 930 950 

TGACGCCATAATTGTTCACATATCTGGTTGGTTTGTACATACGTACTAGACTCTTTTTTT 

970 990 1010 

TCTTTTCTTAATGTAGTACTAAACTTAATTAATACCATCAAAAATATCAATTTAACAAAA 



1030 10 50 



1070 
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CAAACCAGTAAAACTTTTAAAAGAATC 

1090 1110 1130 

TAG ACACAAGGT AACAGAAGT ATAATAACGAC AG AAAAATGAACT^ATTGG C C AAAAAATT 

1150 1170 1190 

CGTTTTCAAACGTGATTTCAAAATTGTCTCCAAATCTTAAATGTTGATAAAGTAATTTTT 

1210 1230 1250 

TTTTAAATTCATTATACCTTTCAAAAACAAGTGT^ 

1270 1290 1310 

CTTACACTCCAAACAAATTTAGTTCCCCAAGTTTGGAAGACT^AAAATTTCTAA 

1330 1350 1370 

CTGACAAAACACATGAGAAATAAACCGATAAAGACTTCTAAAAACTATTGCAGACCAGTT 

1390 1410 1430 

TC^TTTGCTGACCACAAAAAGTCATGAGAATACAATTAGCTCAGTGATTCTTGATATT^ 

1450 1470 1490 

TGGTACCTAACAAAAAGAAAAGTGTGTGAGGTTAGATGGCTATGATTTTTGCTCTCCAAT 

1510 1530 1550 

TTATTGTCCATTTCCCCAATTTGTAATATGAAATGCGCAAATTACTCTTCTTCCC^ 

1570 1590 1610 

AATAAGCAAACGAAAACATACGTGGGACGTTATGTTGAGAACATTTGATTAAAGTTTATA 

1630 1650 1670 

TGCGATTTTCATTTATTTACTATGAATTTOTTC 

1690 1710 1730 

AAC ACACAAATATTATAGAATTTTCATTGGTT CAAAGGGG T AG ACAAAAAATAATTTAAT 

1750 1770 1790 

ATTATTACACCATTTGCAGAAAATTAGAAAATATATTTTTACCC^ 

1810 1830 1850 

TGGACGTATGCTTGGCATAAAAATTCATATTTAATTAGCAGAAGCCAATCGCTGCGTTTG 

1870 1890 1910 

TATATACGCGTTTATGACCGAGAAAAAAACCCTTACGCGTCATGTAAAAAAAAAAGAAGC 

1930 1950 1970 

GTAAATTACGAAAAAC^GAGAGATAAATCCGGGC^TTGAGATTTTGGAGATAGAGAGAGA 

1590 2010 2030 

GAAAAATCGAAATCTATTGTCTATCTCCTCAATTTC 

2050 2070 2090 

CTCTAGATTTCGCGGGTTTTGGATTCGATTCCTTACCCTTCTCCAATCGGTAAGA^ 



2110 2130 2150 

CTCCAAAGTTTGTTCCTTTTTTTCAATTTTCGCCAATTCTC 
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2170 2190 2210 

GTTTGATTTGGATGCAGAAGTTTTTC 

2230 2250 2270 

AGCTCTTTTTGTTAATCAGGTGAGTTTTTAGGTA^ 

2290 2310 2330 

AATGACTAAGTATTGTGAAATGTTTAGGGTTTCATCTGTGTGGGTCTTGTTTTGAAGC^ 

2350 2370 2390 

TTTGTGTGTGTTTGGATGAAAGTAGCAGATATGCAGGACCAGCTGGTGTGTCATGGTTGT 

2410 2430 2450 

AGGAATTTATTGATGTATCCTAGAGGAGCATCTAATGTGCGTTGTGCGTTATGTAACACT 

2470 2490 2510 

ATCAACATGGTTCCTCCTCCTCCTCCACCT(^CGGTATCGATTTCTTTGTTGAATTTG^ 

2530 2550 2570 

TTGAGGATGAGGTTAATATGCTCTGCAATTGTATTATAACTTGGGTTCTGATTCTGT^ATA 

2590 2610 2630 

CAGACATGGCACACATTATATGTGGTGGTTGTAGAACAATGCTTATGTATACGCGTGGGG 

2650 2670 2690 

CTAGTAGCGTAAGATGCTCTTGCTGTCAAACTACGAACCTTGTGCCAGGTATATTAATAA 

2710 2730 2750 

TATCGTGACATC CATATCAATCCTTTTAAAGACCATGTATTATATTG CTTTATAAGGTCT 

2770 2790 2810 

TTTAGTCCTTTAGAATCTTCTTTCACACTTTTGTTTGATAACATTGTTCTGTGGAGATGA 

2830 2850 2870 

TGCTTAGGTAACGTATTTCCACITTTCCCAAAGATGTATATGAATCTGAATTCTGAAAAT 

2890 2910 2930 

ATCTGGGATTTGTAAAGCAGCTGAAAGTACTTAAAACAAAGCTTTT 

2950 2970 2990 

GACTAGGTAACTACTTGTTAGAGCTAGTAGGGTTTATTATTGTTTTGTTTGATCTACCAT 

3010 3030 3050 

TAGATTCTTATCTTTAATTAGCGTCTAAGCTGTTGTCATTTAGCTGTATGATTATCATTT 

3070 3090 3110 

ATC CATGACTGCTTAAGAACATTG CTGATTACTTCGTTCATTAGTATTTCTTGGATTTTT 

3130 3150 3170 

CTAGCATTAACATTGCTTGTTTTCTG7UVTCTGTGCGTGTCTTTTTTGAAATCGACAGCGC 



3190 3210 3230 

ACTCCAATCAGGTTGCCCATGCTCCTTCCAGTCAGGTTGCGCA7UVTCAATTGTGGGCATT 
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' 3250 3270 3290 

GTCGGACGACCCTCATGTATCCTTACGGTGCATCATCCGTC^AATGCGCTGTTTGTCAAT 

3310 3330 3350 

TCGTAACTAACGTTAATGTGATTATTCCTATCTATTJUIGCCACCTCTGCATGGTTGAGTT 

3370 3390 3410 

AAGTATAGAGATCTTTCTGTTGGAAATTTTCATTC 

3430 3450 3470 

AGCAATGGAAGGGGTACCTCTCCCAACTAACCGGGCAAATGGAACAGCTTGTCCCCCCTC 

3490 3510 3530 

TAC^TCAACTGTGAGTTATCAAATTATGAATTTGTAATAGTTCTGTATATTCTTATGGAA 

3550 3570 3590 

CTGGTACTTACTCTGTTCATCGATTTTTCATTTTACCT^ 

3610 3630 3650 

ACCCAAACCGTTGTTGTAGAAAACCCCATGTCCGTTGATGAAAGCGGAAAGTTGGTGAGT 

3670 3690 3710 

ATTTCTATCACCTGTGTTCTTCTTCTTATTTACCACATTAGAGGAAGATATGACAAAGTG 

3730 3750 3770 

ACTGAAACACACAAATTGCAGGTGAGCAATGTTGTTGTTGGAGTGACAACTGACAAAAAG 

3790 3810 3830 

TAATGAAGAATGAGTGAGATCTTAAAGATCAAATCCAAATTCTTCCTCTATTCCTGCGTT 

3850 3870 3890 

TGGTTTGTGCATATTACATACGCGGAAAAACTGTATGTTATATATCTCTTGACTCCTTTT 

3910 3930 3950 

TAACCCAAGAGAAAAAGCTTATCAGAATCTCTTGTTACTGCATTATTGGGGTTTATTCAA 

3970 3990 4010 

AGTTGAAGACA.CAAGGTTTTTGCTCGAATAATTTGGCATTCT 

4030 4050 4070 

ACCTTCTCTTCTGTTAGTTGACTTCTAAAACTCCATCGGCCCTTGTGGCATTGTTAATGT 

4090 4110 4130 

ATGTATGAATATAATCTGATACACCAACCAATCATTAAGATTTGGGTT^ 

4150 4170 4190 

CTTCCGTGGATGAGATATGCTACATGTCACAAGAACTGGTCTTAGCTTTGGTAGATAAGA 

4210 4230 4250 

CTTGTCTTAGAAGCAAGTCTTGAAATCTGGAAATCTATTTTC 

4270 4290 4310 

7VACCATAACCTAATCAGTCAGTACCCTCCAAGAAACATTAAAGTTAGATGATCCGACAAA 



4330 



4350 



4370 
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ACCTCTCAACAAAACCAACTC TTTCCATATAAATACTCTTTAACACTGGAC CAAATTTNC 

4390 4410 4430 

ACCCTTCCTCTTGATCCTCCCTGCATCACAAT 

4450 4470 4490 

TGGGTACCACAAAGAGCTGGAAACTACTCTTGGGGCTGAGAATATTTGCATTCATGGCT^ 

4510 
CTTTAGCTGCAG 



SEQ ID NO:14 

10 30 50 

CTTACGCGTCATGTAAAAAAAAAAGAAGCGTAAATTACGAAAAACAGAGAGATAAATCCG 

70 90 110 

GGCATTGAGATTTTGGAGATAGAGAGAGAGAAAAATCGAAATCTATTGTCTATCTCCTCA 

130 150 170 

ATTTGGATTGGATTTTCTGCATATCATCGCTCTAGATTTCGCGGGTTTTGGATTCGATTC 

190 210 230 

CTTACCCTTCTCCZAATCGAAGTTTTTGGCTTTGAATTGGATTTGGG 

250 270 290 

CAG CTCTTTTTGTTAAT CAGATATG CAGGACCAGCTGGTGTGTCATGGTTGTAGGAATTT 

310 330 350 

ATTGATGTATCCTAGAGGAGC^TCTAATGTGCGTTGTGCGTTATGTAACACTATCAACAT 

370 390 410 

GGTTCCTCCTCCTCCTCCACCTCACGACATGGCACACATTATATGTGGTGGTTGTAGAAC 

430 450 470 

GATGCTTATGTATACGCGTGGGGCTAGTAGCGTAAGATGTTCTTGCTGTCAAACTACGAA 

490 510 530 

CCTTGTGCCAGCGCACTCCAATCAGGTTGCCCATGCTCCTTCCAG 

550 570 590 

GAATTGTGGGCATTGTCGGACGACCCTCATGT^ 

610 630 650 

CGCTGTTTGTCAATTCGTAACTAACGTTAATATGAGCAATGGAAGGGTACCT 

670 690 710 

TAACCGGCCAAATGGAACAGCTTGTCCCCCCTCTACATCAACTTCAA 

730 750 770 

GACCCAAACCGTTGTTGTAGAAAACCCCATGTCCGTTGATGAAAGCGGAAAGTTGGTGAG 

790 810 830 

CAATGTTGTTGTTGGAGTGACAACTGACAAAAAGTAATCA 
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850 870 890 

GATCAAATCCAAATTCTTCCTCTGTTCC^ 

910 930 950 

AAAACTGTATGTTATATATCTCTTGACTCCTTTTTAACCCAAGAGAAAAAGCTTATCAGA 

970 

AAAAAAAAAAAAAAAAA 



SEQIDNO:15 

10 30 50 

GAAATCTATTGTCTATCTCCTCAATTTGGAT^ 

70 90 no 

TTCGCGGGTTTTGGATTCGATTCCTTACCCTTCTCCAATCGAAGTTTTTGG 

130 150 170 

GGATTTGGGTTTCGTTCCAAAATCAGCTCTTTTTG 

190 210 230 

TCTTGTTTTGAAGCAATTTGTGTGTGTTTGGATGAAAGTAGCAGATATGCAGGACCAGCT 

250 270 290 

GGTGTGTCATGGTTGTAGGAATTTATTGATGTATCCTAGAGGAGCATCTAATGTGCGTTG 

310 330 350 

TGCGTTATGTAACACTATCAACATGGTTCCTCCTCCTCCTCCACCTCACGACATGGCACA 

370 390 410 

CATTATATGTGGTGGTTGTAGAACZAATGCTTATGTATACGCGTGGGGCTAGTAGCGTAAG 

430 450 470 

ATGCTCTTGCTGTCAAACTACGAACCTTGTGCC^ 

490 510 530 

TCCTTCCAGTCAGGTTGCGCAGATC^TTGTGGGCATTGTCGGACGACCCTCATGTATCC 

550 570 590 

TTACGGTGCATCATCCGTCAAATGCGCTGTTTGTCAATTCGTAACTAACGTTAATATGAG 

610 630 650 

CAATGGAAGGGTACCTCTCCCAACTAACCGGCCAAATGGAACAGCTTGTCCCCCCTCTAC 

670 690 710 

ATCAACTTCAACACCACCCTCTCAGACCCAAACC 

730 750 770 

TGATGAAAGCGGAAAGTTGGTGAGCAATGTTGTTGTTGGAGTGACAACTGACAAAAAGTA 

790 810 830 

ATCAAGAATGAGTGAGATCTTAAAGATCAAATCCAAATTCITCCT 



850 870 890 

GTTTGTGCATATTACATACGCGGAAAAACTGTATGTTATATATCTCTTGACTCCTTTTTA 
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910 930 950 

ACCCAAGAGAAAAAGCTTATCAGAATCTCTTGTTACTGCATTATTGGGGTTTATTCAAAG 

970 990 
TTGAAGACACAAGGTTTTTGCT CGAAAAAAAAAAAAAAAAAAAAAA 



SEQroNO:16 



MetGlnAspGlnLeuValCysHisGlyCysArgAsnLeuLeuMetTyrProArgGlyAla 

10 20 

SerAsnValArgCysAlaLeuCysAsnThr 1 1 eAsnMe tVal ProProProProProPro 

30 40 

HisAspMet^aHisIlelleC^sGlyGlyCysArgThrMetLeiiMetTyrThrArgGly 

50 60 

AlaSerSerValArgCysSerCysCysGlnThrThrAsnLeuValProAlaHisSerAsn 

70 80 

GlnValAlaHisAlaProSerSerGlnValAlaGlnlleTlsnCysGlyHisCysArgThr 

90 10 0 

ThrLeuMetTyrProTyrGlyAlaSerSerValLysCysAlaValCysGlnPheValThr 

HO 120 

AsnValAsnMetSerAsnGlyArgValProLeuProThrAsnArgProAsnGlyThrAla 

13 0 140 

CysProProSerThrSerThrSerThrProProSerGlnThrGlnThrValValValGlu 

150 i 6 o 

AsnProMetSerValAspGluSerGlyLysLeuValSerAsnValValValGlyValThr 

170 leo 

ThrAspLyaLys 



SEQIDNO:17 



MetLysValAlaAspMetGlnAspGlnLeuValCysHisGlyCysArgAsnLeuLeuMet 

10 20 

TyxProArgGlyAlaSerAsnValArgCysAlal^ 

30 40 

ProProProProProHisAspMetAlaHisIlelleCysGlyGlyCysArgThrMetLeu 

50 60 

MetTyrThrArgGlyAlaSerSerValArgCysSerCysCysGlnThrThrAsnLeuVal 
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70 80 

ProAlaHisSerAsnGlnValAlaHisAlaProSerSerGlnValAlaGlnlleAsnCys 

90 100 

GlyHisCysArgThrThrLeuMetTyrProTyrGlyAlaSerSerValLysC^sAlaVal 

110 120 

CysGlnPheValThrAsnValAsnMetSerAsnGlyArgValProLeuProThrAsnArg 

130 140 

ProAsnGlyThrAlaCysProProSerThrSerThrSerThrProProSerGlnThrGln 

150 160 

ThrValValValGlxiAsnProMetSerValAspGluSerGlyLysLeuValSerAsnVal 

170 180 

ValValGlyValThrThrAspLysLys 



SEQmNO:18 

5'-CAG TGG ATC TTT CCT GAG ACG-3' 
SEQIDNO:19 

5'-CAT CTT CTT CTG CAA TCT GGG-3' 
SEQ ID NO:20 

5'-CAT CCA TCA AAC AAA CTC C-3' 
SEQIDNO:21 

5'-TGT TTC AGA GTA GCC AAT TC-3' 
SEQ ID NO:22 

5'-CAC GTT AGT TAG TTA GAA GG-3 5 
SEQIDNO:23 

5'-CTG ATG TTC TCT ACA AAT GG-3' 
SEQ ID NO:24 



WO 98737755 

10 



5'-CGT ATC CGC ATT TCT TCA CTG C-3' 
SEQ ID NO:25 

5'-CAT CTG CAA CAT CTT CCC CAG-3' 
SEQIDNO:26 

5'-TTG AGT CCT TCT TGT CTG-3' 
SEQ ID NO:27 

5'-CTA GAG CTT GAA AGT TGA TG-3' 
SEQIDNO:28 

5'-GAA TGG TGT AAC CAA ACT C-3' 
SEQ ID NO:29 

5'-CAT ACC GTA TGA TGG AAC-3' 
SEQ ID NO:30 

5'-GAA CTC ATT GTA TGG ACC-3' 
SEQ ID NO:31 

5'-CTA AGA TGG GAA TGT TGG-3' 
SEQIDNO:32 

5'-CCA AGA AGA GAA AAC GGA GA-3' 
SEQIDNO:33 

5'-AAC AAT AGG AGG TGC AGA GT-3' 
SEQIDNO:34 

5'-ACC TAA CAA AAA GAA AAG TGT GTG AGG-3' 
SEQIDNO:35 

5'-ATA ATA AAC CCT ACT AGC TCT AAC AAG-3' 
SEQIDNO:36 

5'-CTG CTA CTT TCA TCC AAA C-3' 
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SEQIDNO:37 

5'- GTA AAA CGA CGG CCA TG -3 ' 
SEQ ID NO:38 

5'- GGA AAC AGC TAT GAC CAT G-3' 
SEQ ID NO:39 

5'- TTC ATG GCA ATG GTG TGA CCC C -3' 
SEQ ID NO:40 

5'- CTG CCG GAT TCT TGA TCG AAG A -3' 
SEQIDNO:41 

5'- AGA GGA AGG TCC GCC TCC GG -3 ' 
SEQ D3 NO:42 

5'- CTC TGC TCT CCT GAG ACT GCT T -3' 
SEQ ID NO:43 

5'- CAT CAT AAT GTC TCC TTT TGA GAC -3' 
SEQ ID NO:44 

5'- GCC ATC CAT TAT TCA TCG CCT -3V 
SEQIDNO:45 

5'- GAG GAG GAA GAA CTG CAG ATT CC -3' 
SEQIDNO:46 

5'- GTG CTC CAT GTC CAA ATC ATA C -3' 
SEQIDNO:47 

10 30 50 

AATATATCGAAACGAGATTCCACyVATTAGTCTCTAGTCAAAGAGCTTCATGGCTUVTGGTG 



70 



90 



110 
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TGACCCCAAATATAGATTTGATGAAAGTGAGGAAATAGGAGAAGAAATGAAGAACACAGG 

130 150 170 

ATGTGTCTTCTTCTTCTAAGTCACTAAC^AAATCAACAAAGAGGAGAAGCCATTATTATA 

190 210 230 

TAATAGAGAGATTGAGAGAAGAGATTTATCCAAAAAAATATTGCAATTCTTCTTGGAGTG 

250 270 290 

AATAATGCCAGTCCCTCTTGCACCATATCCAACACCTCCGGCACCGGCACAGGCTCCGTC 

310 330 350 

GTAO^CACTCCTCCGGCAAATGGAAGTACAAGTGGGCA^ 

370 390 410 

TTGCAGAAACCTTCTGATGTATCCCGTCX3GAGCAACCTCCGTCTGCTGCGCCGTCTGTAA 

430 450 470 

CGCCGTCACGGCCGTTCCTCCGCCGGGAACGGAGATGGCACAGTTAGTATGTGGAGGATG 

490 510 530 

CCATACACTCTTAATGTACATTCGTGGAGCTACAAGTGTTCAATGTTCTTGTTGTCACAC 

550 570 590 

TGTTAATCTCGCCCTCGAAGCGAACCAAGTAGCGCATGTGAATTGCGGAAACTGCATGAT 

610 630 650 

GCTACTAATGTATC^TATGGAGCAAGATCAGTGAAATGTGCCGTTTGTAACTTTGTOVC 

€70 690 710 

ATCTGTTGGGGGTTCAACGAGCACGACTGATTCGAAGTTTAACAATTAAAACTTGGATCT 

730 750 770 

ATCTACCTATC^TATCTATTGAGTTATGAGCAATATAGAGGAAGCATCAAATCTTTTTC 

790 810 830 

ACTCTCTCTTCGATCAAGAATCCGGCAGTTATGAGTTTGAAACCATTTTCGGAAGTAAAT 

850 870 890 

GTU^TATGTAATTCGTCGAAATTTCTGACTTTGGTCTCTTTGTCCGTTTGTATAGAGCTA 

910 

AAAAAAAAAA 



SEQIDNO: 48 



MetProValProLeuAlaProTyrProThrProProAlaProAlaGlnAlaProSerTyr 

10 20 

AsnThrProProAlaAsnGlySerThrSerGlyGlnSerGlnLeuValCysSerGlyCys 
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30 40 

ArgAsnl^uLeuMetTyrProValGlyAl^ 

50 60 

ValThrAlaValProProProGlyThrGluMetAlaGlnLeuValCysGlyGlyCysHis 

70 80 

ThrLexiLeuMetTyrlleArgGlyAlaThrSerValGlnCysSerCysCysHisThrVal 

90 100 

AsnLeiiAlalieuGluAlaAsnGlnValAlaJIisValAsnCysGlyAsnCysMetMetLeu 

HO 120 

LeuMetTyrGlnTyrGlyAlaArgSerValLysCysAlaValCysAsnPheValThrSer 

130 140 

ValGlyGlySerThrSerThrThrAspSerLysPheAsnAsn 

150 

SEQIDNO:49 

CysSerGlyCysArgAsiiLeuLeuMetTy^ 
SEQIDNO:50 

CysGlyGlyCysHisThrLeuLeuMetTyrlleArgGlyAlaThrSerValGlnCysSerCysCys 

SEQIDNO:51 

CysGlyAsriCysMetMetLeuIieiiMetTyrGlnTyrGlyAlaArgSerValLysCy^ 

SEQIDNO:52 

CysXxxXxxCYsArgXxxXxxLeu^^ 

SEQmNO:53 

CysXxxXxxCysXxxXxxLeu^^ 

SEQIi)NO:54 



10 30 so 

GAGGAGGAAGAGGAAGGTCCGCCTCCGGGATGGGAATCTGCAGTTCTTCCTCCTC(^ATC 

70 90 no 

GTCACCATCACCGCCGCCGTAAACCCCAATCCCACCACCGTAGAAATTCCCGAAAAGGCC 



WO 98/37755 

14 



46 



PCT/US98/04077 



130 150 170 

CAAATGGTATGTGGATCTTOCAGGCGTTTGCTTTCTTATCTAAGAGGATCCAAACATGTT 

190 210 230 

AAGTGCTCCTCTTGTCAGACTGTTAATCTCGTTCTTGAAGCTAACCAGGTTGGTCAGGTG 

250 270 290 

AATTGCAACAATTGCAAACTGCTACTGATGTATCCTTATGGAGCTCCAGCTGTTAGATGT 

310 330 350 

TCCTCCTGCAATTCTGTCACAGATATCAGTGAAAACAACAAACGACCTCCATGGTCTGAG 



370 390 



410 



CAGCAAGGACC7VCTCAAAAGTTTAAGCAGTCTCAGGAGAGCAGAGAATTAAACTTGAACC 
430 450 47Q 

GATTTTTGTCAATTTTGAACCGGTTTGACGACTAAAAACCTTGTAATAATGTCGAAGGAT 
490 

AGATGAAATAAAATCACACC 



SEQ ID NO:55 



GluGluGluGluGluGlyProProProGlyTrpQluSerAlaValLeuProProProIle 

10 20 

ValThrlleThrAlaAlaValAsnProAsnProThrThrValGlulleProGluLysAla 

30 40 

GlnMetValCysGlySerCysArgArgLeuLeuSerTyrLeuArgGlySerLysHisVal 

50 60 

LysCysSerSerCysGlnThrValAsnLeuValLeuGluAlaAsnGlnValGlyGlnVal 



70 80 



AsnCysAsnAsnCVsLysLeuLeiiLeuMetTyrProTyrGlyAlaProAlaValArgCys 



90 100 



SerSerCysAsnSerValThrAspIleSerGluAsnAsnLysArgProProTrpS. 



erGlu 

110 120 



GlnGlnGlyPro3JeuI.ysSerLeuSerSerLeuArgArgAlaGluAsn 
55 

SEQIDNO:56 

CGSCRRLLSYLRGSKHVKCSSC 



15 
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SEQIDNO:57 

CNNCKLLLMYPYGAPAVRCSSC 
SEQID NO: 58 



10 30 50 

GGAAGAGATACAACAACAAACGCAGAAGGAAGAACAAAAGCA 

70 90 no 

GGAAGAGGAAGGTCCGCCTCCGGGATGGGAATCTGCAGTTCTTCCTCCTCCAATCGTCAC 

130 150 170 

CATCACCGCCGCCGTAAACCCCAATCCCACCACCGTAGAAATTCCCGAAAAGGCCCAAAT 

190 210 230 

GGTATGTGGATCTTGCAGGCGTTTGCTTTCTTATCTAAGAGGATCCAAACATGTTAAGTG 

250 270 290 

CTCCTCTTGTCAGACTGTTAATCTCGTTCTTGAAGCTAACCAGGTTGGTCAGGTGAATTG 

310 330 350 

CAACAATTGCAAACTGCTACTGATGTATCCTTATGGAGCTCCAGCTGTTAGATGTTCCTC 

370 390 410 

CTGCAATTCTGTCACAGATATCAGTGTATGTATTCACAGATGGTTTTGTGCTCCATGTCC 

430 450 470 

AAATCATACTTGGAAGAGTTGATACATTTTGAGATCCGAGTAAGTAATCATCTGATGAAT 

490 510 530 

CATTTATAATAAACTGTGTTATATTT(^GGAAAACAACAAACGACCTCCATGGTCTGAGC 

550 570 590 

AGCAAGGACCACTCAAAAGTTTAAGCAGTCTCAGGAGAGC^ 

610 630 650 

ATTTTTGTCAATTTTGAACCGGTTTGACGACTAAAAACCTTGT 

670 690 
GATGAAATAAAATCACCATTAATAATCTAAAAAAAAAAAAAAAA 



SEQIDNO:59 



10 30 50 

CTCTATCCTTACTTCAACGGAGCTTTACCAGACCCAAACTCTOTTAGGCTO 
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70 90 110 

TTGTTTGTACGTGTGCITAACGCAGATTACATATGACGCTTOTAACCCaCAATTAATTTG 

130 iso 170 

GTTCACTCTTTGCCGCAAACCAAATAGCTCAAAAAAGATTTTAATCCCAATTT 

190 210 
TAAATGTGCATCATGGTCGGATAGTGTAGTGGCTGTTGGTCCTAATATCTACGCTATTGG 
250 270 29Q 



GGGATTCAGTAATAATAGAACTAAACCTTCGTCTAGCGTCATGGTi 



CATGGATTGTCGTAC 
310 330 350 

TCACACATGGTGTGAGGCCCCTAGCATGCAGGTTTCCCGTGTGTTCCAATCTACTTGCGT 

370 390 410 

CCTTGATGGGAAAATATATGTAACAGGAGGCCGCGGAACTCTCGATTCAACGAAATGGAT 

430 4 5 o 47Q 

GGAGGTTTTTGATACGAAAACCCAAACTTGGGAGTTTTTGCAATTCCCGAGTGAGGAGAA 

490 510 530 

GATATGCACAGGCTATAAGTGTGAGAGCATAGTGTATGAAGGAACTGTCTATGTAAGGTC 



550 S70 590 

CTTT 



GTATTTTCATAATGTGACTTACAAGCTGCATAAAGGTAGATGGATTCAGCGGCAGA! 



610 630 6 5 0 

AGGCGATGAATAATGGATGGCCGTTGCTCATCATTTTTTTGTGTGATAAAGAACGTGTTC 

670 690 710 

TACTTGTTGCAATAGAAGTGGTAACGGTATGATCGATTGGTATGACTCGGAAAAAGGATC 

730 750 770 

ATGGACAACTATGAAGGGGTTGGAAAGATTGCCTAAAGTTTATGGTAATGTTAAATTGGC 

790 810 8 3o 

ATATTATGGTGGAAAAATGGTGGTGGTGTACGTGGAGTGCTAAGGAGTGGGGTAACGTGA 

850 870 890 

GAAAAATTTGGTGTGCGGAAATTACGATTGAAAAACGCAAGGATGGAGAGATTTGGGG 



rGA 

910 930 950 

CTGTAG 



TACTAGAATGGTTTGACGATGTATATAAAGCCAAGGATGAGCTAGAATATTTAG 



970 990 1010 

TGCATGCTGTTGTTACTACCATCTGATTGATAAGAGAGTCATGTGAACATTGTTCATTGA 

1030 1050 1070 

TTCAC^TGC^TAACGAATTTATCTACTATCATTTGTTTTGATTTTCTTTCTAAATCT 

n™. 1090 1110 H30 

TTrrTGTTTGTTCTTGTATTGAATTTTACCTTACATTTATTAAGAAAGTCAACTATTTGT 
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1150 1170 H90 

CAACGTTACTGGAAAGTTAAAAAGGTAAAAGTAATAATAATCTGAGAGTTAACTTTGGAC 

1210 1230 1250 

AT CTTCG C CGGAG CCGAG ACGGAAGGCGTGATGG AAGAGATACAACAACAAACG CAG AAG 

1270 1290 1310 

GAAGAACAAAAGCACCGTGAAGAAGAAGAGGAGGAAGAGGAAGGTCCGCCTCCGGGATGG 

1330 1350 1370 

GAATCTGCAGTTCTTCCTCCTCCAATCGTCACCATCACCGCCGCCGTAAACCCCAATCCC 

1390 1410 1430 

ACCACCGTAGWIAATTCCCGK5TATTCTTGTAGTCTTGTCTATTTTAGGGTCT 

1450 1470 1490 

CTTCCATTTCTTGCTACAGTCTGATCAAAT^ 

15X0 1530 1550 

TAGAGATAACCCATTTTCGATTCCGAGAATTGATTAGTGT TT T T I' TO CTGCIAAATCTTCT 

1570 1590 1610 

TTGTTTTTGGGGTTGTTGCAGAAAAGGCCCAAATGGTATGTGGATCTTGCAGGCGTTTGC 

1630 1650 1670 

TTTCTTATCTAAGAGGATCCAAACATGTTAAGTGCTCCTCTTGTCAGACTC 

1690 1710 1730 

TTCTTGAAGGTTCGTTCTTCCATGGCTTTTTTATCTCTTATTCATTACT^ 

1750 1770 1790 

TGTTGATAATCTCAGTCACTTGAAACTCTTAATGGAACAATCTTGGAATGCTCTCTCAGT 

1810 1830 1850 

CTAGTTTTACTTAGC&TGTGTGAATGATATATC 

1B70 1890 1910 

AAGCTTCCTGAGACCAAATGAGTTTAGTTCTTAACTGACACAAGAATGATCT^ 

1930 1950 1970 

GATTCTTCTCTTAAGCTTTTGTGAGCCTTTTGGTCTCTACTCCATC^ 

1990 2010 2030 

GTAGACCATTTATGTGGTCTTTATCCTTTACT 

2050 2070 2090 

CTTAAGACCAAGATTGTTCTTCTTAGCTTC 

2110 2130 2150 

AGCCTTCTTCTCTTATCGGTTCTGGAC^ 

2170 2190 2210 

TCAGGTGAATTGCAACAATTGCAAACTGCTACTGATGTATCCTTATGGAGCTCCAGCTGT 

2230 2250 2270 
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TAGATGTTCCTCCTGa^TTCTGTCACAGATATCAGTGTATGTATTCACAGATGGTTTTG 

2290 2310 2330 

TGCTCCATGTCCAAATCATACTTGGAAGAGTTGATACATTTTGAGATCCGAGTAAGTAAT 

2350 2370 2390 

CATCTGATGAATC^TTTATAATAAACTGTGTTATATTTCAGGAAAAC^ACAAACGACCTC 

2410 2430 2450 

CATGGTCTGAGCAGCAAGGACCACTCAAAAGTTTAAGCAGTCTCAGGAGAGCAGAGAATT 

2470 2490 2510 

AAACTTGAACCGATTTTTGTCAATTTTGAACCGGTTTGACGACTAAAAACCTTGTAATAA 

2530 2550 2570 

TGTCGAAGGATAGATGAAATAAAATCACCATTAATAATCTCATTGAATTCCCATTCTTTC 

2590 2610 2630 

AGATATTACTTGCTCATCATCCTTTACTGTTTTAAGCTTTAGTGGTTAAAAAGAATGTGT 

2650 2670 2690 

ATATATCCATACAAAAGTTGATATATGTACTGGACCAATATAAACAAACAC^GCTCACAG 

2710 2730 2750 

TCTCACACAATACATAAAAACAAAATTCATATTTCACAGGTGAGAAAAACTAACTAGTAG 

2770 2790 2810 

TCTACTTGGCCGAATTTGTCAATGAATTTCAATAATTAGGTCGTATAAATAGCAAACAAA 

2830 2850 2 870 

ACATGGACTCTTACCCAACCAAATATGCATAAATAATTTACATTACAGTTTCATATAAAA 

2890 2910 2930 

TACAAACTAATGGTGGGTCCTCGAGAGAGCTAACAAGAGCTGTGTGTGGGTGAAGAACCA 



2950 2970 
ACTTGTCAACGAAACCAATTTAATGGAAATCAACCOTAAATTTAATGAAACCTTGGAI 



2990 

.CGA 



3010 3030 3 050 

AACTTACATTTTGTTAACCAGTTTATCCTTTTAAATCAAACC^ 

S£QIDNO:60 

MetGluGluIleGlnGlnGlnThrGlnLysGluGluGlnLysHisArgGluGluGluGlu 

10 20 

GluGluGluGluGlyProProProGlyTrpGluSerAlaValLeuProProProIleVal 

30 40 

ThrlleThrAlaAlaValAsnProAsnProThrThrValGluIleProGluLysAlaGln 

50 60 

MetValCysGlySerCysArgArgLeuLeuSerTyrLeuArgGlySerLysHisValLys 

70 80 
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CysSerSerCysGlnThrValAsnLeuValLeuGluAlaAsnGlnValGlyGlnValAsn 

90 ioo 

CysAsnAsnCYsLysLeuLeuLeuMetTyrProTyr^ 

HO 120 

SerCysAsnSerValThrAspIleSerGluAsnAsnLysArgProProTrpSerGluGln 

130 140 

GlnGlyProLeuLysSerLeuSerSerLexiArgArgAlaGluAsn 

150 



SEQ ID NO:61 



MetGluGluIleGlnGlnGlnThrGlnLysGluGluGlnLysHisArgGluGluGluGlu 

10 20 

GluGluGluGluGlyProProProGlyTrpGluSerAlaValLeuProProProIleVal 

30 40 

ThrlleThrAlaAlaValAsnProAsnProThrThrValGluIleProGluLysAlaGln 

50 60 

MetValCysGlySerC^sArgArgLeuLeuSerTyrLeuArgGlySerLysHisValLys 

70 80 

CysSerSerCysGlnThrValAsnLeuValLeuGluAlaAsnGlnValGlyGlnValAsn 

90 ioo 

C^sAsnAsnC^sLysLeiiLeuLeuMetTyrProTyrGlyAlaProAlaValArgCysSer 

HO 120 

SerCysAsnSerValThrAspIleSerValCysIleHisArgTrpPheCysAlaProCys 

130 140 

ProAsnHisThrTrpLysSer 



SEQIDNO:62 

CysXxxXxxCysXxxXxxLeuLeuX^ 

SEQ ID NO: 63 



I^uValCysHisGlyCysArgAsnLeuLeuMetT^ 
snThrlleAsnMetVal 

IlelleCysGlyGlyCysArgThrMetLeuMetTyrThrArgGlyAlaSerSerValAr 
InThrThrAsnLeuVal 
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IleAsnCysGlyHisCysArgllirThrL^ 
InPheValThrAsnVal 



SEQIDNO:64 

LeuValCysSerGlyCysArgAsnl^uL^ 

snAlaValThrAlaVal 

LeuValCysGlyGlyCysHisTteLeuLe^^ 

isThrValAsnLeuAla 

ValAsnC^sGlyAsnCysMetMetLeuI,euMe^ 

snPheValThrSerVal 



SEQ ID NO: 65 



MetValCysGlySerCysArgArgLeuI^uSerTy^ 
lnThrValAsnLeuVal 
ValAsnCysAsnAsnC^sLysLeuI^u^^ 
snSerValThrAspIle . 



SEQ ID NO: 66 

Nucleic acid sequence of C 



10 30 50 

AGCAACAACAAC^I^CCAGCAACCACC^CCAAC^ 

70 90 no 

GTCACAACCGTAATCCCTCCTCCACCATCTGGATCTGCATC^TAGTC^CCGGAGGAGGA 

130 150 170 

GCGACATACCACCACCTCCTCCAGCAACAACAGCAACAGCTTCAAATGTTCTGGACATAC 

190 210 230 

CAGAGACAAGAGATCGAACAGGTAAACGATTTCAAAAACCATCAGCTCCCTCTAGCTCGT 

250 270 290 

ATCAAAAAAATCATGAAAGCTGATGAAGATGTGCGTATGATCTCCGCCGAAGCACCGATT 

310 330 350 

CTCTTCGCGAAAGCTTGTGAGCTTTTGATTCT 

370 390 410 

GCTGAAGAGAACAAACGTCX3TACGOTTCAGAAAAACGATATCGCTGCTC 

430 450 470 

ACCGATATCTTCGATTTCCTTGTTGATATTGTTCCTAGGGAAGAGATCAAGGAAGAGGAA 

490 510 530 

GATGCAGCATCGGCTCTTGGTGGAGGAGGTATGGTTGCTCCCGCCGCGAGCGGTGTTCCT 
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550 570 590 

TATTATTATCCACCGATGGGACAACCGGCGGTTCCTGGAGGGATGATGATTGGAAGACCG 

610 630 650 

GCGATGGATCCTAGCGGTGTTTATGCTCAGCCTCCT^ 

670 690 710 

CAGAATTCAGCTGGTGGTGGTGATGATGTGTCTTATGGAAGTGGAGGAAGTAGCGGCCAT 

730 750 770 

GGTAATCTCGATAGCCAAGGTTGAGCTATGKSAACCAGA^ 

790 810 830 

TTCGACC CTGCAAGTGT CTGATTCTTATATGTCTATGATTCGAATGACTTA 



SEQroNO:67 

Amino acid sequence of C 



S er AsnAsnAsnAsnAsnGlnGlnPr oPr oPr oThr S erVal Tyr Pr oPr oGlyS e r Al a 

10 20 

ValThrThrVallleProProProProSerGlySerAlaSerlleValThrGlyGlyGly 

30 40 

MaThrTyrHisHisLeiiLeuGlnGlnGlnGlnGlnGlnieuGlnMetPheTrpThrTyr 

50 60 

GlnArgGlnGluIleGluGlnValAsnAspPheLysAsnHisGlnLeuProLeuAlaArg 

70 80 

IleLysLysIleMetLysAlaAspGluAspValArgMetlleSerAlaGluAlaProIle 

90 ioo 

IieuPheAlaLysAlaCysGluLeuPhelleLeuGluLeuThrlleArgSerTrpIxeiiHis 

110 120 

Al aGl uGl uAsnLy s Ar gArgThr LeuGlnLy s AsnAsp 1 1 eAl aAl aAl all eThr Arg 

130 140 

ThrAspIlePheAspPheLeuValAspIleValProArgGluGluIleLysGluGluGlu 

150 160 

AspAlaAlaSerAlalieuGlyGlyGlyGl^etValAlaProAlaAlaSerGlyValPro 

170 180 

TyrTyrTyrProProMetGlyGlnProAlaValProGlyGlyMetMetlleGlyArgPro 

190 200 



AlaMetAspProSerGlyValTyrAlaGlnProProSerGlnAlaTrpGlnSerValTrp 

210 220 
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GlnAsnSerAlaGlyGlyGlyAspAspValSerTyrGlySerGlyGlySerSerGlyHis 

230 240 

GlyAsnLeuAspSerGlnGly 

S£QIDNO:68 

Nucleic acid sequence of CC 



10 30 50 

AGTATC^TGAGCTTTCAGAAGCTTCTCAGATACTCACATGTTGCTCTGACATGGTGTAC 

TGCACGGTTTGCGCATGTATGCAGACACAACACAAGATGGAAATGGACAAGAGGGACGGT 

130 iso 1?0 

AAGTTCGGGCCACAGCCAATGGCAGTGCCTCCGGCTCAGCAAATGTCACGGTTTGATCAA 

190 210 230 

GCCACCCCACCCGCAGTCGGTTATCCTCCACAACAAGGTTATCCACCTTCTGGTTATCCT 

250 270 290 

CAACACCCTCC^C^GGTTATCC^CCTTCTGGCTATCCTCAAAACCCTCCTCCCTCAGCT 

310 330 35 0 

TATTCTCAATACCCTCCTGGGGOTATCCTCCTCCTCCCX3CTTACCCAAAGTGATCACTC 

370 390 410 

TTTGCCTGTTTTCTCTCCCGATTGGAAAATTTTATTTCATCTTTTTTTAATGCTGTCTTG 

430 450 470 

TTACGGGTCAAGAATTGAACGTTCGCTGATTGTTTTGAGGTCGTTGTTTGTATGAGATTT 

490 510 5 3o 

TGACOTCGCATGTTGTTGTTGTTTTCTGAAACGTCCCTCTTGGACTAAGAGATTTCATGA 

550 

CTTAAAAAAAAAAAAAAAAAA 



SEQIDNO:69 

Amino acid sequence of CC 

SerMetAspGluLeuSerGluAlaSerGlnlleLeuThrCysCysSerAspMetValTyr 

10 20 

C^sThrValCysAlaCysMetGlnThrGlnHisLysMetGluMetAspLysArgAspGly 
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30 40 

LysPheGlyProGlnProMetAlaValProProAlaGlnGlnMetSerArgPfaeAspGln 

50 60 

AlaThrProProAlaValGlyTyrProProGlnGlnGlyTyrProProSerGlyTyrPro 

70 80 

GlnHisProProGlnGlyTyxProProSerGlyTyrProGlnAsnProProProSerAla 

90 100 

TyrSerGlnTyrProProGlyAlaTyrProProProProAlaTyrProLys 

110 



SEQ ID NO: 70 

Nucleic acid sequence of FF 



10 30 50 

AGGTTTCCGACGTTGATGACCCAATTTCCGTCGTCGACGAAGACGATTCCGGCATCGTAT 

70 90 110 

TTGCTTCCGTTACAATGGCCTCAGCCGCAGAACGAGGAGATTCTTCTCGCCATGGAAGAA 

130 150 170 

GCTGAGTTCGAAGAAAAGTGCAACGAGATCAGAAAGATGAGTCCTGCTTTACCGGTAATT 

190 210 230 

GGAAAACCAGTCGTCAACAACGAACAAGAAGAGGATGATAATGAAT 

250 270 290 

GCAGATAATGCAGAGGAATCAGATGGTGAAGAGTTTGAGCAAGAAACCGGATAAATAATC 

310 330 350 

TTGAGGCCGAAAATACACAAGGGTTATTGATGGCATT 

370 390 410 

TCTAAATCTTCTTGTGATAAAACGACTGTGATTCTGACTTTC 

430 450 470 

TTTCTTAGGAACGACTGAAATGTTCACTTTT^ 

490 510 530 

TAGTTAATTGTCTCAATCTCATGGTGTTAATTGTGTT^ 

550 570 
GGTTTATAGATTGTAGTGATTTGATGAAAAAAAAAAAAAAAAAAAAA 
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SEQ ID NO: 71 

Amino acid sequence of FF 



ArgPheProThrLeuMetThrGlnPheProSerSerThrLysThrlleProAlaSerTyr 

10 20 

LeiiLeuProLeuGlnTrpProGlnProGlnAsnGluGluIleLeuLeuAlaMetGluGlii 

30 40 

AlaGluPheGluGluLysCysAsnGluIleArgLysMetSerProAlalieuProVallle 

50 60 

GlyLysProValValAsnAsnGluGlnGluGluAspAspAsnGluSerGluAspAspAsp 

70 80 

AlaAspAsnAlaGluGluSerAspGlyGluGluPheGluGlnGluThrGly 

90 



SEQ ID NO: 72 

Nucleic acid sequence of GG 



10 30 50 

AGGGAAACAATGAGCCAGTACAATCAACCTCCCGTTGGTGT^ 

70 90 110 

CCACCGGAGGGATATCCAAAAGATGCTTATCCACCACAAGGATATCCTCCTCAGGGATAT 

130 150 170 

CCTCAGCAAGGCTATCCACCTCAGGGATATCCTCAACAAGGTTATCCTC^GC^GGATAT 

190 210 230 

CCTCCACCGTACGCGCCTCAATATCCTCCACC^ 

250 270 290 

CCTGGCTTTCTAGAAGGATGTCTTGCTGCTCTGTGTTGTTGCTGTCT 

310 330 350 

TTCTGATTGGAGTCTCTCTCTCTCTGCATAAAGCT^ 

370 390 410 

TGGTTAAACAAAAACCTTAATTGATTTGTGGGGCATTAAAAATGAATCT 

430 450 470 

TCTTTCGTTTTATGTGTAATGTTCTTCGGTTCATAACATTTTAA 



490 510 530 

TTCTGCCTTAGTTTGTATTTGATTATGGGAATGTAAATTGGTTGGGAGACACTATT 
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550 570 
GCCATAGTTTATTGCTTGGATCTTCAAAAAAAAAAAAAAAA^ 



SEQIDNO:73 

Amino acid sequence of GG 



ArgGluThrMetSerGlnTyrAsnGlnProProValGlyValProProProGlnGlyTyr 

10 20 

ProProGluGlyTyrProLysAspAlaTyrProProGlnGlyTyrProProGlnGlyTyr 

30 40 

ProGlnGlnGlyTyrProProGlnGlyTyrProGlnGlnGlyTyrProGlnGlnGlyTyr 

50 60 

ProProProTyrAlaProGlnTyrProProProProGlnHisGlnGlnGlnGlnSerSer 

70 80 

ProGlyPheLeuGluGlyCysLeiiAlaAlaLeuCysCysCysCysLeiiLe\iAspAlaCys 

90 100 

Phe 



SEQroNO:74 

Nucleic acid sequence of HH 



10 30 50 

AGTGATGTTCTTCCrAAGTCCGTTGACTGGAGAAACGAAGGCGCA 

70 90 110 

GATCAAGGCCTTTGCAGGAGTTGTTGGGCTTTCT 

130 150 170 

AACAAGATTGTGACTGGAGAGCTAGTAACTTTGTCTGAGCAAGATTTGATC^ 

190 210 230 

AAAGAAAACAATGGTTGCGGAGGAGGCAAAGTCGAGACA^ 

250 270 290 

AATGGTGGTCTTGGTACCGACAACGATTATCCTTACAAAG 

310 330 350 

GGCCGCCTCAAGGAAGACAACAAGAATGTTATGATTGATGGGTATGAGAATTTGCCTG^ 



370 390 410 

AACGATGAAGCCGCTCTCATGAAAGCGGTTGCTCACC^GCCTGTGACTGCCGTTGTCGAT 
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430 450 470 

TCCAGCAGCCGAGAGTTTCAGCTTTATGAATCGGGAGTGTTTGACGGAACTTGCGGAACA 

490 510 530 

AACCTAAACCATGGTGTTGTTGTGGTCGGGTATGGAACCGAGAATGGTCGTGACTACTGG 

550 570 590 

ATTGTGAAAAACTCGAGGGGCGACACATGGGGGGAGGCTGGCTACATGAAGATGGCTCGC 

610 630 650 

AACATTGCCAATCCAAGAGGCATATGTGGCATCGCAATGCGAGCTTCATACCCTCTCAAG 

670 690 710 

AACTCGTTTTCTACGGATAAAGTTTCGGTTGCCTAATAATATGAACTAAATGTATGCCAT 

730 750 770 

GGAACGGATCGGTTAAGCCATTATCGTTATTCGACTTTGAAGGAAACTAAAAAATAATGT 

790 810 830 

GGTCGATTGGTTTGGTT1TGTTATATATTATGCATTTGTATGGGGGTCAGTCAATGTTTG 

850 870 890 

AACTTTGTATAATATTTCTTTGGGTCTAGTGATAAATATTTTCCCnTTTGCGAAAAAAAA 

910 

AAAAAAAAAA 



SEQ ID NO: 75 

Amino acid sequence of HH 



SerAspValLeuProLysSerValAspTrpArgAsnGluGlyAlaValThrGluValLys 

10 20 

AspGlnGlyl^uCysArgSerCVsTrpAlaPheSerThrValGlyAlaValGluGlyLeu 

30 40 

AsnLysIleValThrGlyGluLeuValThrLeuSerGluGlnAspLeuIleAsnC^sAsn 

50 60 

LysGluAsnAsnGlyCysGlyGlyGlyLysValG^ 

70 so 

AsnGlyGlyLeuGlyThrAspAsnAspTyrProTyrLysAlaLeuAsnGlyValCysGlu 

90 loo 

GlyArgLeuLysGluAspAsnLysAsnValMetlleAspGlyTyrGliiAsnLeuProAla 

HO 120 

AsnAspGluAlaAlalieuMetLysAlaValAla^ 

130 140 
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SerSerSerArgGluPheGlnLeuTyrGluSerGlyValPheAspGlyThrCysGlyThr 

150 160 

AsnLexoAsnHisGlyValValValValGlyTyrGlyThrGluAsnGlyArgAspTyrT^ 

170 180 

IleValLysAsnSerArgGlyAspThrTrpGlyGliiAlaGlyTyrMetLysMetAlaArg 

190 200 

Asnl 1 eAl aAsnPr oArgGly 1 1 eCysGly 1 1 eAl aMe tArgAl aS er Tyr Pr oLeuLys 

210 220 

AsnS erPheSer ThrAspLysVal SerValAla 

230 



SEQIDNO:76 

Nucleic acid sequence of I 



10 30 50 

AGCG&AATGCC^GTTTCAGCTCCATCTCCGCCTC^ 

70 90 110 

CCCATCAATTTCACTCCTTCTTCTTTCTC 

130 150 170 

TATCCCCGAATCAAAGCTGAACTCGATCCCAACACGGTAGTCGCGATATCTGTAGGCGTA 

190 210 230 

GCAAGCGTCGCATTAGGAATCGGAATCCCTGTGTTCTACGAGACTCAAATCGACAATGCG 

250 270 290 

GCTAAGCGAGAGAATACTCAACCTTGTTTTCCCTGTAATGGCACCGGAGCrrCAGAAATGC 

310 330 350 

AGATTGTGTCTGGGAAGTGGTAATGTGACOSTAGAGCITGGTGGA 

370 390 410 

TCAAACTGTATCAACTGTGATGGTGCTGGTTC CTTAACTTGCACTACTTGTCAAGGCTCT 

430 450 470 

GGTGTTCAACCTCGATACCTTGATCGAAGGGAGTTOUVGGACGATGACTAAATACCTTGC 

490 510 530 

TCTAAGGAACATTTCTTTTCTTCTCCCTTCTCACATTTCT 

550 570 590 

GTTCACCAAACATGTTGAGAGAAC^TCATGACATGGATATTGTAATTGTGAAAGAAAACC 



610 



63.0 



650 
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ACCA^GTTC^TCAAATGTTTCTTCTTGTACTTAAAAAAAAAAAAAAAAA^ 

SEQ ID NO: 77 

Amino acid sequence of X 



SerGluMetProValSerAlaProSerProProArgLeuHisSerProPhelleHisCys 

10 20 

ProIleAsnPheThrProSerSerPheSerAlaArgAsnLeuArgSerProSerThrSer 

30 40 

TyrProArglleLysAlaGluLeuAspProAsnThrValValAlalleSerValGlyVal 

50 60 

AlaSerValAlaLeuGlylleGlylleProValPheTyrGluThrGlnlleAspAsnAla 

70 80 

AlaLysArgGl\iAsnThrGlnProCysPheProCysAsnGlyThrGlyAlaGlnLysCys 

90 ioo 

ArgLeuCysValGlySerGlyAsnValThrValGluLeuGlyGlyGlyGluLysGluVal 

HO 120 

SerAsnCysIleAsnCysAspGlyAlaGlySerLeuThrCysThrThrCysGlnGlySer 

130 140 

GlyValGlnProArgTyrLexiAspArgArgGluPheLysAspAspAsp 

150 



SEQ ID NO: 78 

Nucleic acid sequence of II 



10 30 50 

AGAGAAAACATGGGAGGTGACAATGATAATGACAAAGA 

70 90 no 

CCCGCTGGATACCCACCCCCTGGGGCTTATCCACCCGCTGGATACCCACA^ 

130 150 170 

CCTCCACCACCCGGTGCTTACCCGCCTGCAG 

190 210 230 

CCTGGTGGTTATCCTCCCGCCCCrroGTTATGGTGGTTATCCTCCAGCTCCTGGTTATGGA 



250 



270 



290 
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GGTTATCCTCCTGCACCTGGTCATGGTGGTTACCCTCCTGCTGGCTATCCTGCTCATCAC 

310 330 350 

TCAGGACACGCAGGAGGAATTGGGGGTATGATTGCAGGTGCTGCAGCTGCCTATGGAGCT 

370 390 410 

CACCACGTATCTCATAGCTCTCACTGTCCTTACGGAC^ 

430 450 470 

GGCCATGGTC^TGGCTATGGCTATGGTC^TGGTCATGGTAAGTTCAAGCATGGGAAGCAC 

490 510 530 

GGGAAGTTCAAGCATGGGAAGC^TGGAATGTTTGGAGGAGGCAAGTTC^AGAAGTGGAAG 

550 570 590 

TGATCTAGCTATTACCTTGTGTGAATTTGTCTGGAOT 

610 630 650 

AAACATTATATAAGTTGACTTTCGTCGGTTAGATTGCTGGTTCGAGTTGGAATAATTGAA 

670 690 710 

ACTTAATTAGTATCAAATCTTATTGTGTACTTO^ 

730 750 770 

GATTCTGGTTTCGGTGTGTTGTTTTAAGATTTTTGTATATACTGTTTTTTACATTGCTTA 

790 810 
AGCTTATAGAAGTCATGATTATGATTAAAAAAAAAAAAAAAAAA 



SEQ ID NO: 79 

Amino acid sequence of II 



ArgGlxiAsiiMetGlyGlyAspAsnAspAsnAspLysAspLysGlyPheHisGlyTyrPro 

10 20 

ProAlaGlyTyrProProProGlyAlaTyrProProAlaGlyTyrProGlnGlnGlyTyr 

30 40 

ProProProProGlyAlaTyrProProAlaGlyTyrProProGlyAlaTyrProProAla 

50 60 

ProGlyGlyTyrProProAlaProGlyTyrGlyGlyTyrProProAlaProGlyTyrGly 

70 80 

GlyTyrProProAlaProGlyHisGlyGlyTyrProProAlaGlyTyrPrpAlaHisHis 

90 ioo 

SerGlyHisAlaGlyGlylleGlyGlyMetlleAlaGlyAlaAlaAlaAlaTyrGlyAla 

HO 120 



HisHisValSerHisSerSerHisCysProTyrGlyHisAlaAlaTyrGlyHisGlyPhe 
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130 140 

GlyHisGlyHisGlyTyrGlyTyrGlyHisGlyHisGlyLysPheLysHisGlyLysHis 

150 160 

GlyLysPheLysHisGlyLysHisGlyMetPheGlyGlyGlyLysPheLysLysTrpLys 

17 ° 180 



SEQ ID NO: 80 

Nucleic acid sequence of K 



10 30 so 

AGTGTC^CTACTCCATCCGAGG^ 

70 90 110 

ACACCGAACCAGCGAACCAGAGTTCCCGTGAGTCy^TTCGCGCOjCCGAATTATCAGCAA 

130 150 170 

GCTAATGTTAACCTATCTGTTGGGAGGCCATGGAGCACTGGTTTGTTTGATTGTCAAGCA 

190 210 230 

GACCAAGCCAATGCCGTTTTGACCACAATTG 

250 270 290 

GAAGTGATGGATGAAGGAGAGATGACTTGTCCTCTTGGAACTTTCATGTACTTATTGATG 

310 330 350 

ATGCCGGCTTTATGCTCT(^CTGGGTGATGGGAT(^^GTATAGAGAAAAAATGAGGAGA 

370 390 410 

AAATTT7^TCTTGTGGAAGCTCCATATTCAGATTGTGCCAGTCATGTCCTATGCCCTTGT 

430 450 470 

TGCTCTCTTTGTCAAGAATACAGAGAGCTCAAGATTAGGAATCTTGATCCT 

490 510 530 

TGGAATGGGATACTTGCTCAAGGACAAGGACAATA 

55 <> 570 590 

CCTACAAATCAATATATGTCTAAGTAAACATTTGATTTTAGTTGACTTC 

610 530 650 

AAACATTATTTGTGGACCATTGTACAATGAAAGT^ 

670 690 
GTGTGAGATTGATAAAAAAAAAAAAAAAAAAA 



SEQ ID NO: 81 
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Amino acid sequence of K 

SerValThrThrProSerGluGlxiAspSerAsnAsnGlyLeuProValGlnGlnProGly 

10 20 

ThrProAsnGlnArgThrArgValProValSerGlnPheAlaProProAsnTyrGlnGln 

30 40 

AlaAsnValAsnLeuSerValGlyArgProTrpSerThrGlyLeuPheAspCysGlnAla 

50 60 

AspGlnAlaAsnAlaValLeuThrThrlleValProCysValThrPheGlyGlnlleAla 

70 80 

GluValMetAspGluGlyGluMetThrCysProLeuGlyThrPheMetTyrLeuLetiMet 

90 100 

MetProAlaLeuCysSerHisTrpValMetGlySerLysTyrArgGluLysMetArgArg 

110 120 

LysPheAsnlieuValGl uAl aProTyr Ser AspCysAl aS erHis Val LeuCy s Pr oCys 

130 140 

CysSerlieuCysGlnGluTyrArgGluIieiiLysIleArgAsnLeuAspProSerLeuGly 

150 160 

TrpAsnGlylleLeuAlaGlnGlyGlnGlyGlnTyrGluArgGluAlaProSerPheAla 

170 180 

ProThrAsnGlnTyrMetSerLys 



SEQIDNO:82 

Nucleic acid sequence of M 

10 30 50 

AGAAAATACGAAAAGGTCTCCCTCCCAGC^CCTTACGTGGCT 

70 90 110 

GAAGACGACGGTCAATACTATCCCGGCAAATACGAAAAAGCC^ 

130 150 170 

GTGGCCGGATATCCGAGCCAtCATGAAGACGATGGTCAATACTATCCTGGCAAATACGAA 

190 210 230 

AAGGTCTCCCTCCCAGCACCTTACGTGGTCGGACACCCGAGCCACTCCGAAGATGATGGC 

250 270 290 

CAATACTATCCCGGCAAATACGAAAAGGCCTCCGTCCCATCAGCTTAOSTGGCCGAAC^C 

310 330 350 

TCGAGCCACTCCGAAGATGATGGCC/^TACTATCCTGGCAAATACGAAAAGCCCGAACAC 
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370 390 410 

CATTACTGAAAACTCTCACAC^CAATGATTCTC^TCCTTCCGTAGTCTTTTAATTCGAC 

430 450 470 

TTTT7JVCAATAAAAACGTGATCTTAATTTTTCATCAAAAAAAAAAAAAAAAAAA 



SEQroNO:83 

Amino acid sequence of H 



ArgLysTyrGluLysValSerLeuProAlaProTyrValAlaGlyHisSerSerHisHis 

10 20 

GluAspAspGlyGlnTyrTyrProGlyLysTyrGluLysAlaSerLeuProAlaProTyr 

30 4 0 

ValAlaGlyTyrProSerHisHisGliiAspAspGlyGlnTyrTyrProGlyLysTyrGlu 

50 60 

LysValSerLeuProAlaProTyrValValGlyHisProSerHisSerGluAspAspGly 

70 80 

GlnTyrTyrProGlyLysTyrGl^ 

90 100 

SerSerHisSerGluAspAspGlyGlnTyrTyrProGlyLysTyrGluLysProGluHis 

HO 120 

HisTyr 



SEQIDNO:84 

Nucleic acid sequence of OO 



10 30 50 

AGCCGATCTCAGATTCTTCCATCTTCCAGGAGGAATTTOVGTGTGGCGACCAC^CAGCTT 

70 90 no 

GGCATTCCAACAGACGATCTAGTCGGCAATCACA 

130 150 170 

AAGAAATCACCTATGGAACTGATTAGTGAGGTTCCA 

190 210 230 

GTTGCTTGTGAAGGAGACACC^TCCGGCCCTAGGTCATCCAATCGAGTTCATATGCCTC 
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250 270 290 

GACCTAAATGAGCCTGCGATOTGCAAGTACTGCGGCCTTCGTTATGTTC^AGATCATCAC 

310 330 350 

CATTGAGGCAAATTCTGAAAGTGAACTGCTGGTCTCTCTCC^ 

370 390 410 

AGTTTGTGTATTGTTTTTTTCTGGTGTGCCTACTA 

430 450 470 

AGGATTCGATCAAAGTCGGGTAAGTTTGATTTTTGTTTGATCTCACTTCAGCACTTGTCA 

490 510 530 

tgttgtaacattcaatctctgatatcactgtctt^ 

550 

AAAAAAAAAAAAAAAA 



SEQIDNO:85 

Amino acid sequence of 00 

SerArgSerGlnlleLeuProSerSerArgArgAsnPheSerVaJLAlaThrThrGlnLeu 

10 20 

GlylleProThrAspAspLeuValGlyAsnHisThrAlaLysTrpMetGlnAspArgSer 

30 40 

LysLysSerProMetGluLeuIleSerGluValProProIleLysValAspGlyArglle 

50 60 

ValAlaCysGluGlyAspThrAsnProAlaLeuGlyHisProIleGluPhelleCysLeu 

70 80 

AspLeuAsnGluProAlalleCysLysTyrCysGlyLeuArgTyrValGlnAspHisHis 

90 100 



HisEndGlyLysPhe 

SEQIDNO:86 

Nucleic acid sequence of P 



10 30 50 

AGAACAGCTCGAGTTCCTTATGGGCCTAGACTCTCTGGTGGTGGTTACAACCGATCTGGA 



70 90 110 

AACAGGGTTCCGCGTAACAAACCAAGCTTCCCCAATAGCACCGAGTCCAATGGTGAGGCT 



WO 98/37755 

34 



66 



PCT/US98/04077 



130 150 170 

AATCAATTCAATGGCCCAAGAATAATGAACCCCCATGCT 

190 210 230 

CCTTGGGTTTCTAATGGGTATCCAGTGTCACCAAATGGCTATTTAGCATCCCCAAATGGT 

250 270 290 

GCAGAAATAACACAGAATGGGTACCCTTTGTCACCAGTAGCAGGTGGATATCCGTGTAAC 

310 330 350 

ATGTC CGTTACACAGCC TCAGGATGGACTTGTTTCAGAGGAATTACCTGGTGCTI^GAAGC 

370 390 410 

T CTGAGGAGAAGAGCGGAAGCG AAGAAGAAAGCAACAACGACAAAAATG CTGGAG AGGAT 

430 450 470 

GACGAAGCCGTTGGACAAGAAACTACAGATACACCTGAT^AATGGACATTCGACAGTAGGT 

490 510 530 

GAAGTGGAAACCACATCACATGAGACTTGTGATGAGAAAAATGGAGAACGACAAGGAGGC 

550 570 590 

AAGTGCTGGGGAGATTACAGCGATAATGAAATCGAGCAAATTGAAGTTACAAGTTGAAGA 

610 630 650 

CGCAACTGTCTGTTACTGAAGTATTAACATT^ 

670 690 710 

CTCCATTGATGAGGTTAAAGGTAAACAATCAT 

730 750 770 

GAATGTTTTGTGTTGTAGCGCAGGACCAAGGCTCGTCACTCCrrcCTTTAAC^ 

790 810 830 

CCTGCTTTCAGTTTTTGGTTTGATAGCTGAAAACTAGATATATTCAACTCCTTAATAAAA 

850 870 
GATTTGTC CCTTTGTTTAAAAAAAAAAAAAAAAAAAAAA. 



SEQ ID NO: 87 

Amino acid sequence of P 

ArgThrAlaArgValProTyrGlyProArgl^uSerGlyGlyGlyTyrAsnArgSerGly 

10 20 

AsnArgValProArgAsriLysProSerPheProAsnSerThrGluSerAsnGlyGliiAla 

30 40 

AsnGlnPheAsnGlyProArglleMetAsnProHisAlaAlaGluPhelleProSerGln 

50 60 

ProTrpValSerAsnGlyTyrProValSerProAsnGlyTyrLeuAlaSerProAsnGly 
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70 80 

AlaGluIleThrGlnAsnGlyTyrProLeuSerProVa]JUaGlyGlyTyrProCysAsn^ 

90 100 

MetSerValThrGlnProGlnAspGlyLeuValSerGluGluLeuProGlyAlaGlySer 

110 120 

S er GluGl uLysS erGlySer GluGluGluS erAsnAsnAspLysAsnAl aGlyGlxiAsp 

130 140 

AspGliiAlaValGlyGlnGluThrThrAspThrProGliiJVsnGlyHisSerThrValGly 

150 160 

GluValGluThrThrSerHisGluThrCysAspGluLysAsnGlyGluArgGlnGlyGly 

170 180 

LysCysTrpGlyAspTyrSerAspAsnGluIleGluGlnlleGluValThrSer 

190 



SEQ ID NO: 88 

Nucleic acid sequence of T 

10 30 50 

AGAGACCATCCAGCTTACCATCAGAT 

70 90 110 

CTTCAATCTTTCTGGGAGACTCAATTCAAAGA 

130 150 170 

CATAGCCTTCCATTGGCAAGAATCAAGAAAATCATGAAAGCTGATGAAGAT 

190 210 230 

ATCTCGGCCGAGGCGCCTGTTGTGTTCGCCAGGGCCTGCGAGATGTTTATTCTGGAGCTT 

250 270 290 

ACGTTAAGGTCTTGGAACCATACTGAGGAGAACAAGAGAAGGATOTTGCAGAAGAATGAT 

310 330 350 

ATCGCGGCTGCGGTGACTAGAACTGATATTTTTGATTTTCTTGTGGATAOT 

370 390 410 

GAGGATCTTCGTGATGAAGTCTTGGGTGGTGTTGGTGCTGAAGCTGCT 

430 450 470 

TATCCGTATGGATACTTGCCTGCTGGAAC^GCTCC^TTGGGAACCCGGGAATGGTTATG 

490 510 530 

GGTAACCCGGGCGCGTATCCGCCGAACCCGTATATGGGTCAGCCAATGTGGCAACAACCA 



550 570 590 

GGACCTGAGCAGCAGGATCCTGACAATTAGCTTGGCCTAATAAACTAGCCGTCTAATTCG 
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610 1 630 650 

AAGCTCTCCCCGGTGGATCTACTCAAGAA^ 

670 690 710 

AAAAAGTTTGGTGTAGTAGAATAATTTCTGTTTTATC^ 

730 750 770 

TTCAGTTTGGTTTATCrTGTCATCAAACTGTO 

790 810 830 

GAGAATTGAACTTACAAAAGGCTAGATTACTACTTATAAAG 



AAAAAAAA 



SEQIDNO:89 

Amino acid sequence of T 



ArgAspHisProAlaTyrHisGlnlleHisGlnGlnGlnGlnGlnGlnLeuThrGlnGln 

10 20 

LeuGliiSerPheTrpGluThrGlnPheLysGluIleGluLysThrThrAspPheLysAsn 

30 40 

HisSerLeuProLeuAlaArglleLysLysIleMetLysAlaAspGluAspValArgMet 

50 60 

IleSerAlaGluAlaProValValPheAlaArgAlaCysGlxiMetPhelleLeuGluIieu 

70 80 

ThrLexiArgSerTrpAsnHisThrGluGliiAsnLysArgArgThrLeuGlnLysAsnAsp 

90 ioo 

IleAlaAlaAlaValThrArgThrAspIlePheAspPheLeuValAspIleValProArg 

HO 120 

Gl\iAspLeiiArgAspGluValLeuGlyGlyValGlyAlaGliiWaAlaThrAlaAlaGly 

130 140 

TyrProTyrGlyTyrLeuProPro<31yThrAlaProIleGlyAsnProGlyMetValMet 

150 i 6 o 

GlyAsnProGlyMaTyrProProAsnProTyrMetGlyGlnProMetTrpGlnGln^ 

170 180 



GlyProGluGlnGlnAspProAspAsn 
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SEQIDNO:90 

Nucleic acid sequence of X 

10 30 50 

AGATTCGCTATTCCTGGCAAAGAAAGACAAGATTCTGTTTACAGTGGACTTCAGGAAATC 

70 90 110 

GATGTGAACTCTGAGCTTGTTTGTATCCACGACTCTGCCCGACCATTGGTGAATACTGAA 



130 150 170 

GATGTCGAGTVAGGTCCTTAAAGATGGTTCCGCGGTTGGAGCAGCTGTACrrTGGTGTTCCT 

190 210 230 

GCTAAAGCTACAATCAAAGAGGTCAATTCTGATTCGCTTGTGGTGAAAACTCTCGACAGA 

250 270 290 

AAAACCCTATGGGAAATGCAGACACCACAGGTGATCAAACCAGAGCTATTGAAAAAGGG^ 

310 330 350 

TTCGAGCTTGTAAAAAGTGAAGGTCTAGAGGTAACAGATGACGTTTCGATTGTTGAATAC 

370 390 410 

CTCAAGCATCCAGTTTATGTCTCTCAAGGATCTTATACAAACATCAAGGTTACAACACCT 

430 450 470 

GATGATTTACTGCTTGCTGAGAGAATCTTGAGCGAGGACTCATGAGATATTATATCATTT 

490 510 530 

ACTTAGTAAGAAGACGTGTCAAGGGTATGC^TGAAAAATGTTTTATTGAAATCTTTGCAT 

550 570 590 

CCTAGTTTGGTGGTTTATAAAATGTGCAAGATAATTGTTTCACTGAAAACTACTTGCTGT 

610 630 650 

GAATATGGATTCGAACAGAGCC^TTCGAAGTAGAATTTGCATATTGTAAAAAAAAAA 

670 

AAAAAAAAAA 



SEQIDNO:91 

Amino acid sequence of X 

ArgPheAlalleProGlyLysGluArgGlnAspSerValTyrSerGlyLeuGlnGluIle 

10 20 

AspValAsnSerGluLeuVal Cys I leHisAspSerAlaArgProLeuValAsnThrGlu 

30 40 

AspValGluLysValLeuLysAspGlySerAlaValGlyAlaAlaValLeuGlyValPro 

50 60 
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AlaLysAlaThrlleLysGluValAsnSerAspSerLeuValValLysThrLeuAspArg 

70 80 

LysThrLeuTrpGltiMetGlnThrProGlnVallleLysProGluLeuLeiiLysLysGly 

90 100 

PheGluIieuValLysSerGluGlyLeuGluValThrAspAspValSerlleValGluTyr 

110 120 

LeuLysHisProValTyrValSerGlnGlySerTyrThrAsnlleLysValThrThrPro 

130 140 

AspAspLeuLeuLeuAlaGluArglleLeuSerGluAspSer 

150 
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THE CLAIMS 

What is Claimed Is: 

1 . An isolated DNA sequence that encodes a LSD1 polypeptide. 

2. The isolated DNA sequence of claim 1, wherein the sequence is selected from the group 
consisting of SEQ ID N013, SEQ ID NO 14 and SEQ ID NO 15. 

3. The isolated DNA sequence of claim 1, wherein the sequence comprises SEQ ID NO 
13. 

4. The isolated DNA sequence of claim 1 , wherein the sequence comprises SEQ ID NO 
14. 

5. The isolated DNA sequence of claim 1 , wherein the sequence comprises SEQ ID NO 
15. 

6. The isolated DNA sequence of claim 1, wherein the DNA is cDNA. 

7. The isolated DNA sequence of claim 1 , wherein the DNA is genomic. 

8. The isolated DNA sequence of claim 1 , wherein the polypeptide comprises SEQ ID NO 
16. 

9. The isolated DNA sequence of claim 1 , wherein the polypeptide comprises SEQ ID NO 
17. 

10. A protein encoded by the isolated DNA sequence of claiml. 

1 1. A chimeric construction comprising a promoter sequence and a DNA sequence 
according to claim 1. 

12. A transformation vector comprising the isolated DNA sequence of claim 1. 

13. A mutated DNA sequence derived from the DNA sequence of claim 1. 
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14. A transgenic plant expressing LSD1 mutant genes that affect resistance to herbicidal 
compounds that normally result in plant cell death. 

5 15. A transgenic plant expressing LSD1 mutant genes which affect resistance to plant 
pathogens that normally result in plant cell death. 

16. A messenger RNA encoding LSD1. 

10 17. An isolated DNA sequence that encodes the zinc finger consensus selected from the 
group consisting of SEQ ID NOS 1-3. 

18. A protein containing a zinc finger protein selected from the group consisting of 
CxxCxRxxLMYxxGASxVxCxxC, CxxCRxxLMYxxGASxRxVxCxxC, 
15 CxxCxxLLMYxxGAxSxCxxC, CxxCxxLLxYxxGxxxVxCSSC, 

CSGCRNLLMYPVGATSVCCAVC, CGGCHTLIMYIRGATSVQCSCC, 
CGNCMMLLMYQYGARSVKCAVC, CGSCRRLLSYLRGSKHVKCSSC, and 
CNNCKLLLMYPYGAPAVRCSSC, wherein x is any substituted amino acid. 

20 19. A gene encoding a zinc finger protein according to claim 1 8. 

20. An isolated DNA sequence encoding a protein according to claim 1 8. 

21. A recombinant plant transformed with the DNA sequence as claimed in claim 1 . 

25 

22. A recombinant plant transformed with the DNA sequence as claimed in claim 20. 

23. An isolated DNA molecule that hybridizes under hybridization conditions to a DNA 
sequence as claimed in claim 1 . 

30 

24. An isolated DNA molecule that hybridizes under hybridization conditions to a DNA 
sequence as claimed in claim 20. 

25. An isolated DNA sequence that encodes a LSD1 homologue. 

35 

26. The isolated DNA sequence of claim 25, wherein the homologue is selected from the 
group consisting of LOL 1 and LOL2. 
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27. The isolated DNA sequence of claim 25, wherein the homologue is selected from the 
group consisting of SEQ ID NO:48, SEQ ID NO:55, SEQ ID NO:60 and SEQ ID 
NO:62. 

5 

28. The isolated DNA sequence of claim 25, wherein the sequence is selected from the 
group consisting of SEQ ID NO:47, SEQ ID NO:54, and SEQ ID NO:59. 

29. The isolated DNA sequence of claim 25, wherein the sequence comprises SEQ ID NO 
10 47. 

30. The isolated DNA sequence of claim 25, wherein the sequence comprises SEQ ID NO 
54. 

15 31. The isolated DNA sequence of claim 25, wherein the sequence comprises SEQ ID NO 
59. 

32. The isolated DNA sequence of claim 25, wherein the DNA is cDNA. 
20 33. The isolated DNA sequence of claim 25, wherein the DNA is genomic. 

34. A recombinant plant transformed with the DNA sequence as claimed in claim 25. 

35. An isolated DNA molecule that hybridizes under hybridization conditions to a DNA 
25 sequence as claimed in claim 25. 

36. A protein encoded by the isolated DNA sequence of claim 25. 

37. A chimeric construction comprising a promoter sequence and a DNA sequence 
30 according to claim 25. 

38. A transformation vector comprising the isolated DNA sequence of claim 25. 

39. A mutated DNA sequence derived from the DNA sequence of claim 25. 

35 

40. A transgenic plant expressing LOL1 mutant genes that affect resistance to herbicidal 
compounds that normally result in plant cell death. 
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41 . A transgenic plant expressing LOL1 mutant genes which affect resistance to plant 
pathogens that normally result in plant cell death. 

5 42. A messenger RNA encoding LOLl. 

43. A transgenic plant expressing LOLl mutant genes that affect resistance to herbicidal 
compounds that normally result in plant cell death. 

10 44. A transgenic plant expressing LOL2 mutant genes which affect resistance to plant 
pathogens that normally result in plant cell death. 

45. A messenger RNA encoding LOL2. 

15 46. A nucleic acid that interacts with LSD1, selected from the group consisting of the 
nucleic acid sequences set forth in SEQ ID NOS :66-91 . 

47. A protein encoded by a nucleic acid according to claim 46. 
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Fig. 7A 

LSDl 

consensus 



10 LVCHGCRNIXMYPRGASNVRCALCNTINMV 
51 HCGGCRTMLMTTRGASSVRCSCCQTTNLV 
98 I NCGHCRTTLMY PYGAS S VKC AVCQ FVTNV 

C CR LMY GAS V C C V 



Fig-7B 

LOLl 



consensus 



35 LVCSGCRNLLMYPVGATSVCCAVCNAVTAV 
7 4 LVCGGCHTLLMY IRGATSVQCSCCHTVNLA 

112 VNCGNCMMIiLMyQYGARSVKCAVCNFVTSV 

C C LLMY GA SV C G V 



Fig- 7C 

L0 L2 61 MVCGSCRRLLSYLRGSKHVKCSSCQTVNLV 

9 9 VNCNNCKLLLMYPYGAPAVRCSSCNSVTDI 



consensus 



C C I*L Y G V CSSC V 
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Fig.8A 

First zinc finger 



LSD1 LVCHGCRNLLMYPR6ASNVRCALCNTINMV 

LOL1 LVCSGCRNLLMYPVGATSVCCAVCNAVTAV 

LOL2 MVCGSCRRLLSYLRGSKHVKCSSCQTVNLV 

consensus VC CR I*L Y G V C C V 



Fig. 8B 

Second zinc finger 



LSD1 IICGGCRTMIMfTRGASSVRCSCCQTTNLV 

LOL1 LVCGGCHTLIMYIRGATSVQCSCCHTVNLA 

LOL2 VNCNNCKLLLMYPYGAPAVRCSSCNSVTDI 

consensus C C LMY GA V CS C 



Fig.8C 

Third zinc finger 

LSD1 INCGHCRTTLMYPYGASSVKCAVCQFVTNV 
L0L1 VNCGNCMMLLMYQYGARSVKCAVCNFVTSV 

consensus NCG C LMY YGA SVKCAVC FVT V 
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